General features of the X. campestris pv. vesicatoria genome.
The
X. campestris pv. vesicatoria strain 85-10 genome sequence was established by employing a whole-genome shotgun approach. In the final consensus sequence, each base matched at least phred40 quality. The assembly of the high-quality sequence was validated by a complete fosmid map (Fig.
1).
X. campestris pv. vesicatoria contains a single, circular chromosome of 5,178,466 bp (Fig.
1) and four extrachromosomal elements (Table
1). While the presence of plasmids pXCV2 (1,852 bp), pXCV38 (38,116 bp), and pXCV183 (182,572 bp) was known previously (unpublished data), the genome project revealed the existence of a fourth plasmid, which was designated pXCV19 (19,146 bp) (Table
1).
The G+C content averages 64.75% for the chromosome and varies between 56.59% and 60.73% for the plasmids. The origin of replication is clearly detectable by a bias of G toward the leading strand (GC skew) (
76). The
dnaA gene is located in this region, and, therefore, its start codon has been defined as the zero point of the chromosome (Fig.
1). The GC skew analysis of the chromosome indicates a bidirectional replication mechanism, but the chromosome is not clearly divided into two equal replichores (Fig.
1A). The location of the predicted replication terminus appears to be skewed from the 180° position opposite to
oriC, resulting in an approximately 370-kb size difference between the replichores.
X. campestris pv. vesicatoria contains a total of 4,726 predicted coding sequences (CDS) (Table
1). The chromosome shows an average coding capacity of 87.13%, which is typical for most bacteria. There is no significant asymmetry in the distribution of CDS on the chromosome between the leading strand (2,223 CDS; 49.5%) and the lagging strand (2,264 CDS; 50.5%). Based on the manual annotation, biological roles were assigned to 3,080 of the 4,726 CDS. The remaining 1,646 CDS comprise 697 conserved hypothetical CDS and 949 CDS of unknown function (Fig.
1A; Table
1).
X. campestris pv. vesicatoria contains two rRNA operons organized in the order 16S-23S-5S and located in a region of approximately 500 kb (between 4,600,000 bp and 5,100,000 bp) on the left replichore (Fig.
1B). Altogether, 56 genes for tRNAs representing all 20 amino acids were identified. A total of 54 tRNA genes are located on the chromosome (Fig.
1B), whereas two genes for tRNAs both recognizing the AUA codon are exclusively carried on the plasmids pXCV19 and pXCV183 (Table
1).
Horizontal gene transfer and genome plasticity.
The genome of
X. campestris pv. vesicatoria contains 66 IS elements. A total of 58 IS elements are located on the chromosome, and 8 are located on the four plasmids (Table
1; see supplementary Table
1 at
https://www.genetik.uni-bielefeld.de/GenoMik/genomeproj.html ). A total of 48 elements belong to the IS
3 family, 10 elements belong to the IS
5 family, and 4 elements belong to the IS
6 family. Four IS elements could not be grouped with a known family. In
X. campestris pv. vesicatoria, the IS
3 family member IS
476/IS
1477 (20 copies) is the most abundant element. IS
476 has been characterized in
X. campestris pv. vesicatoria and shown to inactivate the avirulence gene
avrBs1 (
50). The IS
3 family is also highly abundant in
X. axonopodis pv. citri, whereas in
X. campestris pv. campestris, most IS elements belong to the IS
3 and IS
5 families.
An indication of high genomic plasticity is the presence of atypical DNA regions, for example, regions that differ in their G+C contents from the average of the genome. Most likely, these regions were acquired by horizontal gene transfer (
31). The chromosomal CDS range in their G+C contents from 40.1% to 75.1%, with an average of 65.1%. More than 85% of the CDS have a G+C content between 60% and 70%. Analysis of the 30 CDS with the lowest G+C contents (40.1% to 50.4%) revealed that 50% are ORFans, i.e., genes that are restricted to a particular genome and that possess no known homologs. With one exception, these ORFans are short (average length, 475 bp). A total of 21 ORFans are found in the vicinity of IS elements. In contrast, there are no ORFans among the 30 CDS with the highest G+C contents (72.3% to 75.1%). The small size and low G+C content of the ORFans in
X. campestris pv. vesicatoria are similar to observations made in
Escherichia coli (
28).
To identify alien nucleotide sequences, the chromosome was analyzed for regions with significant deviations from the genomic mean in G+C content (see Materials and Methods). These regions vary in size between 2 kb and 26 kb and contain 50 out of 58 IS elements and 15 tRNA genes. The presence of IS elements and tRNA genes is typical for genomic islands (
31). Interestingly, the two largest regions (3,870,000 to 3,899,000 bp and 3,190,000 to 3,215,000 bp) carry parts of a type IV secretion system (
virB6 or
virB9,
virB8, and
virD4) and components of the type 4 pilus (i.e.,
pilE and
fimT). The location of parts of the type IV apparatus within a genomic region of atypical G+C content, an indicator for horizontal gene transfer, has also been observed for
Wolinella succinogenes, a member of the ε-proteobacteria (
3). Interestingly, another ∼15-kb atypical region carries the complete xanthan biosynthesis cluster
gumA to
gumM and is flanked on one side by IS
1477. An ∼14-kb region harboring genes with homology to the filamentous phage φLf (∼2,784,000 to 2,798,000 bp) could also be identified by this approach. Additionally, smaller atypical regions encode putative virulence determinants, such as type III effector proteins and adhesins (see below).
In summary, features such as the high number of mobile elements and deviations in G+C content suggest a highly flexible genome. This is advantageous for pathogen evolution driven by the need for continuous adaptation to the host in order to evade or suppress coevolving host defense mechanisms. This idea is corroborated by the presence of a large number of virulence determinants (see below).
Comparison of the X. campestris pv. vesicatoria sequence to genomes of other plant-pathogenic bacteria.
Previously, the complete genomic sequences of three different xanthomonads,
X. axonopodis pv. citri,
X. campestris pv. campestris, and
X. oryzae pv. oryzae, causal agents of citrus canker, the systemic black rot disease of crucifers, and rice blight, respectively, were determined (
26,
53). All four xanthomonads have similar chromosome sizes; however, they differ in their plasmid contents.
X. campestris pv. vesicatoria harbors four plasmids, and
X. axonopodis pv. citri carries two plasmids, whereas
X. campestris pv. campestris and
X. oryzae pv. oryzae lack any plasmids. A comparison of the chromosomal sequences of the four xanthomonads revealed that
X. campestris pv. vesicatoria is most closely related to
X. axonopodis pv. citri (Fig.
1A; Fig.
2). Since the genome structure of
X. oryzae pv. oryzae is completely different from the three other xanthomonads (Fig.
2B), we focused our comparative analyses of the genomic organization on
X. axonopodis pv. citri and
X. campestris pv. campestris. Most genes are syntenic to each other among these three xanthomonads; however, striking differences are found between the flagellar gene cluster and the replication terminus of the chromosomes (Fig.
1A; Fig.
2A). Integration events in this region are probably the reason for the asymmetric replichores (see above).
The DNA sequences of
X. campestris pv. vesicatoria and
X. campestris pv. campestris chromosomes differ in the locations of some large gene clusters (Fig.
2A). For instance, two gene clusters coding for different type II secretion systems and their flanking regions are colinear between
X. campestris pv. vesicatoria and
X. axonopodis pv. citri, whereas their positions are exchanged in the chromosome of
X. campestris pv. campestris compared to the other two xanthomonads (Fig.
2A). The same is true for the chromosomal location of the
hrp gene cluster, which is found at the same position in
X. campestris pv. vesicatoria and
X. axonopodis pv. citri but at a different location in
X. campestris pv. campestris (Fig.
2A). Thus, the chromosomes of
X. campestris pv. vesicatoria and
X. axonopodis pv. citri are not completely colinear with
X. campestris pv. campestris (Fig.
2A).
Using a cutoff E value of 10−30, comparisons of the predicted protein sequences of the four xanthomonads revealed that 2,999 CDS (66.8%) of the X. campestris pv. vesicatoria chromosome are orthologs of genes in X. axonopodis pv. citri, X. campestris pv. campestris, and X. oryzae pv. oryzae, thus representing the conserved chromosomal backbone of these four species. A total of 184 CDS (4.1%), 87 CDS (1.9%), and 45 CDS (1.0%) were considered orthologs of genes present in only X. axonopodis pv. citri, X. campestris pv. campestris, and X. oryzae pv. oryzae, respectively. A total of 548 CDS (12.2%) are unique to X. campestris pv. vesicatoria, most of which encode hypothetical and conserved hypothetical proteins.
We also compared X. campestris pv. vesicatoria with the following other plant-pathogenic bacteria: X. fastidiosa, P. syringae pv. tomato, R. solanacearum and E. carotovora subsp. atroseptica. Because of the low overall DNA sequence identity, we limited the comparisons to predicted protein sequences. Using the same cutoff E value as above, there are 1,714 CDS (38.2%) in X. campestris pv. vesicatoria which are homologous to predicted proteins of X. fastidiosa, 1,780 CDS (39.7%) of P. syringae pv. tomato, 1,600 CDS (35.7%) of R. solanacearum (chromosome and megaplasmid), and 1,434 CDS (32.0%) of E. carotovora subsp. atroseptica.
Four plasmids of X. campestris pv. vesicatoria strain 85-10.
Many
Xanthomonas strains carry plasmids, but little is known about their function and relevance for pathogenicity (
102).
X. campestris pv. vesicatoria strain 85-10 harbors four different plasmids (Table
1). The smallest plasmid, pXCV2, is basically identical to pXV64 of
X. campestris pv. vesicatoria strain Xv64 (
106). The presence of plasmid pXCV19 was detected only due to the efforts of the genome sequencing project and contains almost exclusively genes necessary for plasmid partitioning and conjugation. pXCV38 shows similarities to the
X. axonopodis pv. citri plasmid pXAC33; however, pXCV38 lacks the type III effector genes
pthA1 and
pthA2. Instead, pXCV38 contains nine genes encoding a putative type IV secretion system of the Vir/Tra type (XCVc0028 to XCVc0033, XCVc0035, XCVc0041, and XCVc0042) which probably serve for conjugal transfer.
Intriguingly, the largest plasmid in strain 85-10, pXCV183, encodes a putative type IV secretion system which is most similar to the Icm/Dot system of human pathogens (
21). We could identify 14 of the 22 components found in
Legionella pneumophila and
Coxiella burnetii (
21,
111) (see supplementary Table
2 at
https://www.genetik.uni-bielefeld.de/GenoMik/genomeproj.html ). We alsofound candidate genes which could fulfill the function of the missing components. This is the first report of a putative Icm/Dot-like type IV secretion system in a plant-pathogenic bacterium. The essential role of Icm/Dot type IV secretion for the virulence of
Coxiella and
Legionella species (
21,
111) suggests that this system might contribute to the virulence of
Xanthomonas.
One interesting question concerns the segregational stability of the plasmids, which appears to be maintained by different mechanisms. pXCV183 contains a putative postsegregational killing system (Fig.
3) of the ζ toxin (XCVd0099) ε antitoxin (XCVd0100) type (
59), which is also present on pXAC64. Additionally, the presence of an isoleucin tRNA gene on plasmids pXCV183 and pXCV19, which is missing in the chromosome, might contribute to plasmid stability. This feature is unique for
X. campestris pv. vesicatoria relative to
X. axonopodis pv. citri.
Genetic information storage and processing.
All basic genes for information storage and processing (i.e., DNA replication, recombination, repair, transcription, and translation) are present. Interestingly, there are three genes encoding histone-like nuclear structuring proteins (H-NS). Mutations in these genes often have pleiotropic effects, and a regulatory function of these genes in virulence has been implicated (
80).
Most of the transcriptional regulators in
X. campestris pv. vesicatoria belong to one of the following families: LysR (40 genes), AraC (18 genes), TetR (17 genes), LacI (10 genes), and MarR (10 genes). In addition,
X. campestris pv. vesicatoria contains several putative two-component signal transduction systems. Approximately half of the 68 response regulators possess a predicted DNA binding motif as an output domain, while the rest contain other output domains, such as diguanylate cyclase (GGDEF domain), cyclic diguanylate (c-diGMP) phosphodiesterase (EAL domain), metal-dependent phosphohydrolase (HD domain), and CheB-like methylesterase. Altogether, 29 GGDEF and 14 EAL domains were found. These protein domains are responsible for the synthesis and turnover of the secondary messenger c-diGMP. Originally, this signal molecule was identified as an allosteric regulator of the cellulose synthase of
Glucoacetobacter xylinum (
86). In several bacteria, cellulose serves as an extracellular matrix involved in biofilm formation (
84). Interestingly,
X. campestris pv. vesicatoria also possesses a cellulose synthase operon (XCV3640 to XCV3644) with a c-diGMP binding subunit (XCV3642). However, this catalytic subunit is probably not functional due to an internal stop codon (XCV3643 to XCV3644).
Although histidine kinases and methyl-accepting proteins (see below) are the main sensors of extracellular signals, they are by no means the only ones present in
X. campestris pv. vesicatoria. As known for most bacteria, PAS (31 proteins) and GAF (17 proteins) domains are the most common cytoplasmic signaling domains found in
X. campestris pv. vesicatoria (
99).
Finally, the genome of
X. campestris pv. vesicatoria encodes 15 RNA polymerase σ factors (
69), including 1 primary σ factor (RpoD), 1 heat-shock factor (RpoH), 1 flagellar-specific factor (FliA), 10 alternate extracytoplasmic function-type factors, and 2 σ54 family factors (RpoN). Interestingly, one of the alternate sigma factors (XCV1276) carries a C-terminal extension of approximately 200 amino acids which is unique to
X. campestris pv. vesicatoria,
X. campestris pv. campestris, and
X. oryzae pv. oryzae.
The high number and modular diversity of regulatory proteins relative to those of other bacteria, e.g.,
E. coli (
36), suggest that xanthomonads are able to cope with complex and changing environmental conditions. Future studies will reveal whether any of these proteins are components of the regulatory cascade which controls expression of virulence factors like the TTSS.
Quorum sensing, chemotaxis, and motility for the adaptation to environmental conditions.
Quorum sensing is a mechanism by which bacteria regulate the expression of certain genes in response to their population density (
103). Only a few xanthomonads appear to produce the typical autoinducer molecule
l-homoserine lactone, and there is no indication that
X. campestris pv. vesicatoria does so (
22). Two different alternative autoinducers have been well studied in
X. campestris pv. campestris. The diffusible signal factor (DSF), an α,β-unsaturated fatty acid (
105), is involved in the regulation of the synthesis of extracellular enzymes, exopolysaccharides, and cyclic glucans. Eight genes (
rpfA to
H) which are involved in DSF production and perception are also present in
X. campestris pv. vesicatoria, thus providing circumstantial evidence for the existence of a DSF regulatory system in
X. campestris pv. vesicatoria. A second diffusible factor (DF), a butyrolactone, is involved in the regulation of pigment (xanthomonadin) and exopolysaccharide synthesis in
X. campestris pv. campestris (
77). Next to the xanthomonadin biosynthesis gene cluster in
X. campestris pv. vesicatoria is an operon that might correspond to the
pigB locus of
X. campestris pv. campestris, which has been implicated in the synthesis of DF (
77). Both quorum-sensing systems participate in the regulation of exopolysaccharide, an important virulence factor in
X. campestris pv. campestris (
75,
78). Based on the conserved gene content, these systems are probably functional in
X. campestris pv. vesicatoria.
X. campestris pv. vesicatoria carries a single unipolar flagellum. We identified a 147-kb region in the chromosome (116 genes, XCV1929 to XCV2044) which is almost exclusively devoted to chemotaxis and flagellar biosynthesis. The most interesting feature of this large chromosomal region is the presence of 14 tandemly repeated genes encoding methyl-accepting chemotaxis proteins. Similar gene clusters were also found in X. axonopodis pv. citri (10 mcp genes), X. campestris pv. campestris (10 mcp genes), and X. oryzae pv. oryzae (7 mcp genes). Intriguingly, this region appears to be highly dynamic since its size and gene order differ among the four Xanthomonas strains analyzed. The following other chemotaxis-related genes are also present in several copies which are dispersed in the chromosome: 10 more mcp genes, 3 cheA homologs, 2 cheB homologs, 4 cheR homologs, and 6 cheW homologs. It will be interesting to elucidate which environmental stimuli are sensed by this complex chemotactic system.
Bacterial motility is not limited to swimming. Type 4 pili enable movement by retraction and mediate bacterial adhesion to plant tissue. Immunofluorescence studies showed that purified type 4 pili of
X. campestris pv. hyacinthi attached to stomata of hyacinth leaves, suggesting a role for these surface structures in the first stages of yellow disease (
100). In
X. campestris pv. vesicatoria strain 85-10, several operons are predicted to encode components of type 4 pili. Although type 4 pilins are very diverse in their primary structure, they can be identified by the presence of a prepilin peptidase-processing site followed by a transmembrane α-helix. Using this criterion, 15 type 4-related pilin genes were found, 8 of which are part of the two different type II protein secretion systems (see below). So far, the function of type 4 pili in pathogenicity of
X. campestris pv. vesicatoria is not clear (
74).
Surface structures involved in bacterial interactions.
Besides type 4 pili, several other surface structures might be implicated in the adherence of gram-negative bacterial pathogens.
X. campestris pv. vesicatoria encodes several proteins which could act as adhesins (XCV1860, XCV1861, XCV2103, XCV3670, XCV3672, XCV4203, and XCV4444), such as homologs of the
Yersinia proteins YapH and YadA (
43), HecA from
Erwinia chrysanthemi (
83), and the filamentous hemagglutinin from
Bordetella pertussis (
56). In
E. chrysanthemi, HecA is encoded next to the TTSS cluster and plays a role in virulence (
51). Homologs of
hecA are also present in the genomes of
Xylella spp. and
R. solanacearum (
83). The nonfimbrial adhesin YadA forms a sheath-like structure on the surface of
Yersinia and mediates adherence to epithelial cells and autoagglutination (
43). Homologs of YadA have been found in several plant and animal pathogenic bacteria (
73), e.g., in
X. oryzae pv. oryzae, where XadA was shown to be an important virulence factor (
79).
Gram-negative bacteria exhibit complex sets of polysaccharides on their surfaces, which often contribute to their pathogenic interactions with plant and animal cells (
29). The typical exopolysaccharide of the genus
Xanthomonas is xanthan, the structure and biosynthesis of which have been well studied in
X. campestris pv. campestris (
7). The
gum gene cluster which controls xanthan biosynthesis in
X. campestris pv. campestris is conserved in
X. campestris pv. vesicatoria (XCV2776 to XCV2789). Mutant studies in
X. campestris pv. campestris demonstrated that xanthan gum is not essential for pathogenicity but contributes to bacterial aggressiveness against the host (
49).
Lipopolysaccharides from
Xanthomonas play a role as elicitors of plant defense reactions (
61,
66). Lipopolysaccharide biosynthesis in
X. campestris pv. vesicatoria strain 85-10 appears to follow the ABC transporter-dependent pathway (encoded by
wzm and
wzt). Next to the genes for the ABC transporter and the lipopolysaccharide core biosynthesis (
rmd and
gmd), several genes for O-antigen synthesis (
wxc) are predicted. Interestingly, the
wxc genes have counterparts in
X. campestris pv. campestris but not in
X. axonopodis pv. citri or
X. oryzae pv. oryzae. However, all of the glycosyltransferase genes required for O-antigen modification in
X. campestris pv. campestris (
104) are missing in
X. campestris pv. vesicatoria. Instead, two deviant glycosyltransferase genes (
wbdA1 and
wbdA2) were found within this gene cluster, and they encode homologs of mannosyltransferases (see supplementary Table
3 at
https://www.genetik.uni-bielefeld.de/GenoMik/genomeproj.html ). Thus, it seems that the O-antigen of
X. campestris pv. vesicatoria is basically a polymannan, in contrast to the xylosylated polyrhamnan of
X. campestris pv. campestris (
64).
Surprisingly, a novel gene cluster of 11 genes (XCV1723 to XCV1733), which encodes a putative polysaccharide polymerase and a chain length regulator, both characteristic for heteropolymer polysaccharide biosynthesis, was identified. In addition, this cluster harbors three predicted glycosyltransferases and enzymes modifying sugar nucleotides or transferring amino or methyl groups. Homologs of this gene cluster were also found in the genomes of X. axonopodis pv. citri, X. campestris pv. campestris, and X. oryzae pv. oryzae. It is not clear which carbohydrate is produced by this novel gene cluster.
Pathogenicity determinants of X. campestris pv. vesicatoria.
Bacterial secretion systems are important for the interaction of pathogens with the host (
18).
X. campestris pv. vesicatoria contains genes for all known protein transport systems in gram-negative bacteria, namely the Sec, signal recognition particle, and TAT pathways; at least one putative type I, two type II, one type III, and two putative type IV secretion systems of different types; four type V autotransporters; and two two-partner secretion systems (Table
2; supplementary Table
3). The presence of all known secretion systems was highlighted before for only the plant pathogens
E. carotovora subsp. atroseptica (
8) and
R. solanacearum (
37).
The Sec pathway is important for the export of many proteins into the periplasmic space (Table
2; see supplementary Table
3). The TAT system (
tatA-tatC) offers an alternative route to the periplasm for folded proteins (
81), but so far the function of this pathway has not been elucidated for
Xanthomonas. Predicted candidate substrates of the TAT pathway are 66 proteins (Table
2; see supplementary Table
4 at
https://www.genetik.uni-bielefeld.de/GenoMik/genomeproj.html ), e.g., putative xylosidases and cellulases.
Type II secretion systems of pathogenic bacteria secrete a vast number of proteins, including possible virulence determinants, into plant intercellular space (
89).
X. campestris pv. vesicatoria, like
X. axonopodis pv. citri and
X. campestris pv. campestris (
26), harbors two type II secretion systems, whereas
X. oryzae pv. oryzae encodes only one (Fig.
2). Remarkably, the gene for a prepilin leader peptidase is lacking. Instead, this task might be fulfilled by the type 4 pilus assembly leader peptidase (XCV3355) (
45). In
X. campestris pv. vesicatoria strain 85-10, likely substrates include cellulases (nine candidates), β-glucosidases (five candidates), pectate lyases (four candidates), polygalacturonases (three candidates), and xylanases (five candidates) (Table
2; see supplementary Table
3). These proteins probably exhibit plant cell wall-degrading activity. The nature and number of these candidates are similar in all four xanthomonads. Additionally, we found a disrupted homolog of Pat-1 from the gram-positive tomato pathogen
Clavibacter michiganensis subsp. michiganensis (XCV4424 and XCV4425). This gene, which encodes a putative secreted serine protease that is essential for wilt induction on tomato (
33), is not present in
X. axonopodis pv. citri and
X. campestris pv. campestris. However, a nondisrupted homolog is present in
X. oryzae pv. oryzae (XOO0986), suggesting a possible role of this gene in virulence.
In addition to the well-studied secretion systems,
X. campestris pv. vesicatoria contains homologs of
raxST,
raxA,
raxB, and
raxC (XCV1244 to XCV1246 and XCV3591), which are involved in AvrXa21 activity in
X. oryzae pv. oryzae. The nature of AvrXa21 is unknown, but it leads to specific recognition of bacteria in rice plants expressing the
Xa21 resistance gene (
27). RaxA and RaxB show similarity to a double-glycine leader peptide-dependent secretion system and might, in concert with RaxC, secrete the so-far-unknown substrate (
27).
The main virulence determinant of
X. campestris pv. vesicatoria is the TTSS (
17,
19), which is encoded by the
hrp pathogenicity gene cluster (
11) located in a 35.3-kb chromosomal genomic island (459,555 bp to 494,868 bp). Interestingly, the regions flanking the 16.2-kb core cluster (
hrcC-hpaB) encode accessory components and substrates of the TTSS and are not conserved among the different xanthomonads (
19,
53,
72).
Most promoters in the
hrp gene cluster contain PIP boxes (TTCGC-N
15-TTCGC), the supposed binding motif for the transcriptional regulator HrpX (
35,
107). In addition to 8 known PIP boxes, we identified 15 novel boxes within a reasonable distance upstream of the CDS (Table
3). The genome contains 189 imperfect PIP boxes (TTCG-N
16-TTCG); however, no clear association with pathogenicity-related genes is evident. Future work will elucidate how many of these genes are regulated by HrpG and HrpX and might play a role in pathogenicity. However, it is known already that not all HrpX-regulated genes possess this motif in their promoter regions (
71). The genome sequence enables us to determine the complexities of HrpG- and HrpX-dependent regulation of gene expression by transcriptome analyses.
TTSS of plant pathogens translocate a large number of effector proteins into the plant cells (
1). So far, 14 type III effectors of
X. campestris pv. vesicatoria strain 85-10 are known (
19,
60,
70,
82). Analysis of the
X. campestris pv. vesicatoria genome sequence led to the identification of six new candidate effectors (XCVd0105, XCV0294, XCV1298, XCV2280, XCV3786, and XCV4428) (Table
4), most of which have homologs in other xanthomonads and different
P. syringae strains but not in
E. carotovora subsp. atroseptica (Table
4). Type III effectors of
P. syringae were designated according to the new unified nomenclature (
55). It appears that homologs of HopX2 (formerly HopPmaB), HopH1 (formerly HopPtoH), HopQ1-1 (formerly HopPtoQ), and XopP are distributed most widely among plant-pathogenic bacteria. For almost all effectors, homologous proteins can be found in other pathogenic bacteria. The only exception is XopC, which is unique for
X. campestris pv. vesicatoria (
70). It should be noted that in contrast to other
X. campestris pv. vesicatoria strains, strain 85-10 does not contain homologs of the well-studied AvrBs3 family of effectors (
17). Xanthomonads also lack effectors of the YopT family, which is found in many animal and plant-pathogenic bacteria, such as
P. syringae and
R. solanacearum. In summary, the set of effectors differs between several species and even between closely related bacteria like the four xanthomonads (Table
4). The specific effector set of a given strain is certainly important for its interaction with the plant, thus potentially determining host range.
Ten of the 20 candidate and proven effector genes of
X. campestris pv. vesicatoria show significantly lower G+C contents, indicative for acquisition by horizontal gene transfer (
31). This idea is corroborated by the fact that some of the predicted effector genes are located in the vicinity of mobile genetic elements. It is noteworthy that we identified two novel effector integrons similar to a previously described element (Fig.
3) (
70). All three integrons are flanked by inverted repeats that were first identified next to genes coding for effectors of the AvrBs3 family (
10,
70). One of the newly identified integrons is located on the chromosome and is associated with a gene (XCV2280) encoding a homolog of the effector HopX2 from
P. syringae (Table
4) (
40). The other integron is located on the plasmid pXCV183 and harbors the putative postsegregational killing system. This integron is flanked by a type III effector locus, which encodes AvrBs1 and a putative tyrosine phosphatase homologous to the effector HopAO1 (formerly HopPtoD2) from
P. syringae pv. tomato DC3000 (
12,
34). We were surprised that these are the only two effector genes present on the plasmids of
X. campestris pv. vesicatoria strain 85-10. Similar effector integrons are also present in the
X. axonopodis pv. citri chromosome and plasmid pXAC64 (
70).
In addition to protein secretion systems, other factors might be important for the pathogen-plant interaction. We identified 65 proteins with possible functions in defense of
X. campestris pv. vesicatoria against antimicrobial substances (Table
2; see supplementary Table
3). Interestingly, genes coding for a copper-efflux system of
X. campestris pv. vesicatoria (XCV3747 and XCV3748) were first described to be carried on the same plasmid as AvrBs1 (
92), but in strain 85-10, they are located on the chromosome. The known streptomycin resistance of this strain is encoded on a transposon (
93). Additionally, we identified genes coding for catalases (five candidates), peroxidases (five candidates), and superoxide dismutases (four candidates), which might function in detoxification of reactive oxygen species produced during the plant defense reaction (see supplementary Table
3).
Conclusions.
In this study, we present the complete genome sequence of X. campestris pv. vesicatoria, which revealed novel insights into the genome plasticity and pathogenicity of this important plant pathogen. All xanthomonads carry a high number of mobile elements; however, their set differs in each strain. Several regions of atypical G+C content are associated with both IS elements and tRNA genes and contain putative pathogenicity determinants which might have been acquired by horizontal gene transfer. The presence of four plasmids, two of which carry a tRNA gene not present on the chromosome, was unexpected. The role of the plasmids in pathogenicity is currently being addressed by plasmid curing.
Intriguingly, X. campestris pv. vesicatoria possesses all known types of protein secretion systems. Previous studies revealed that the TTSS is essential for pathogenicity. Here, several novel type III effectors were predicted and these will be analyzed in the future. In addition, the role of the other secretion systems in the interaction with the plant will be examined with a focus on cell wall-degrading enzymes and adhesins. One of the highlights of our genome analysis is the finding of a candidate Icm/Dot-like type IV secretion system in X. campestris pv. vesicatoria.
We were surprised by the number of genes devoted to regulation, a process that is poorly understood in X. campestris pv. vesicatoria. The identification of two quorum-sensing systems and of a complex and apparently dynamic chemotactic system will stimulate the study of regulation in the context of pathogenicity.
Finally, the first identification of genes possibly involved in adhesion will open new directions in the study of the complete arsenal of virulence factors in X. campestris pv. vesicatoria.