Genetic diversity of MGEs in clade 4 (transposons, prophages, CRISPRs, and plasmids).
(i) Transposons and conjugative transposons. Several transposons (Tns) and conjugative transposons (CTns), including CTn
5, Tn
4453a/b, Tn
5397, Tn
5398, Tn
6194, Tn
916, and novel putative transposons containing regions homologous with CTn
2 and CTn
7 (
Table 1), were identified by comparison with reference genomes. A novel putative transposon element was discovered in isolate ZR9. This element was located in scaffold 35 from 2,689 to 3,4943 bp and contained 36 coding sequences (CDS), of which 10 were unique in this element, while the remaining 26 genes demonstrated 83.33% to 100% identity with CTn
2 (
Fig. 2A). Among the 10 unique genes, G002737, G002739, G002743, G002760, and G002761 were annotated as
C. difficile putative conjugative transposon protein Tn
1549-like, CTn
2-ORF30, CTn
2-ORF32, CTn
5 ORF2, 18, and 19, respectively. G002741 was annotated as putative RNA methyltransferase of
C. difficile E7. G002744, G002747, G002748, and G002759 were defined as hypothetical protein, bacterial regulatory helix (AraC family protein), transcriptional regulator (effector binding domain protein), and transcriptional regulator (AbrB family domain protein), respectively. In isolate ZR18, another putative novel mobile element was found, including 24 coding sequences, of which 10 genes were absent in CTn
7, while the other genes were similar to those in CTn
7 (
Fig. 3A). Large fragment insertions and deletions located in 630_03693 to 630_03688, and 630_03686 to
mgtA2 were identified in isolate ZR18 (
Fig. 3A). The 10 novel genes (G001824, G001827, G001828, G001967, G001969, G001972, G001974, G001975, G001976, and G001977) in ZR18 were annotated as hypothetical protein, CTn
7-ORF 4, 6, and conjugal transfer protein of
C. difficile, and also proteins of other gut pathogens such as
Firmicutes and
Enterococcus faecium. In addition, a CTn
7-like element was identified in reference M68, showing >85% identity with 630 CTn
7 (
Fig. 3A). Two genes (M68GM003529 and M68GM003530) were inserted between 630_03695 and 630_03694. In M68, M68GM003535 replaced 630_03690 at the same site, while 630_03685 was missing (
Fig. 3A). It is noteworthy that nearly half of genes in CTn
7 were lost in M68, with the addition of another 11 coding genes (
Fig. 3A), encoding transcriptional regulator, ABC transporter, antimicrobial resistance gene
ermB (M68GM003552), and
bcrA (M68GM003544), putative truncated zeta protein, ATPase-associated proteins, and other genes originating from
C. difficile and other enteric bacteria (
Erysipelotrichaceae,
E. faecalis,
E. faecium,
Campylobacter jejuni,
Firmicutes, and
Staphylococcus aureus).
CTn
5-like transposons, belonging to the Tn
1549 family, were detected widely in the tested
C. difficile isolates of clade 4, with the exception of isolate 35 and 11032 (
Fig. 2B). Ten types were divided according to gene composition as follows (
Fig. 2B): type 1 (HN9), type 2 (BJ08), type 3 (15), type 4 (GZ8), type 5 (ZR18), type 6 (GZ3), type 7 (29), type 8 (2, 28, 5, 7, ZR8, ZR9), type 9 (23, 38, 4, 50, 6, GZ11, GZ12, GZ2, GZ6, ZR29, ZR58, ZR59, Z566, 10122), and type 10 (16, GZ13, GZ14, ZR65, ZR68, ZR72, ZR73, ZR82). Two common major deletions in the conjugative region were observed in these 10 types, one being a deletion of 630_02052, and the other one located between 630_02066 and 630_02072 (
Fig. 2B). Especially for types 3, 5, and 8, other large fragment deletions were also discovered in the conjugative region (
Fig. 2B).
Tn
4453a/b was found with typical gene organization in isolates ZR18, 2, 7, and 28 (
Fig. 3B). The composition and location of genes in ZR18 was identical to that in reference Tn
4453a/b, with the exception of a −114-bp deletion in
tnpv (
Fig. 3B). The key feature of this element was the presence of
catD, which mediates resistance to chloramphenicol. Interestingly,
catD was absent in isolates 2, 7, and 28 and replaced by five new genes: aminoglycoside acetyltransferase (
aac), three transferases (phosphotransferase, nucleotidyltransferase, and acetyltransferase) and an ATPase (
Fig. 3B). The
aac gene, which confers resistance to aminoglycoside antibiotics, such as gentamicin, tobramycin, and netilmicin has been widely reported in
Enterococcus and
Enterobacteria, and can be transferred between Gram-positive and Gram-negative bacteria by transposons (Tn
5281, Tn
4001, and IS256) or plasmids (
31). BLAST analysis revealed that the DNA sequences of
aac in isolates 2, 7, and 28 showed 100% coverage and identity with
aac(6′
) aph(2′′
) in
C. jejuni. The total length of the gene was 1,455 bp, encoding a bifunctional aminoglycoside
N-acetyltransferase and aminoglycoside phosphotransferase, which is one of the aminoglycoside-modified enzymes (AMEs), playing critical role in high-level gentamicin-resistant (HLGR) of
Enterococcus (
32).
Tn
5397, also known as CTn
3 in CD630, was recognized in most isolates in this study and was classified into four types as follows: type 1, ZR18 and M68; type 2, ZR73 and 5; type 3, BJ08, GZ8, and HN9; type 4, ZR82, ZR9, 2, 23, 28, 29, 38, 4, 6, 7, GZ11, GZ12, GZ13, GZ14, GZ2, GZ3, GZ6, ZR29, ZR58, ZR59, ZR66, ZR72, and ZR8 (
Table 1). The
tndX gene, encoding a large serine recombinase, which is essential for insertion/excision of Tn
5397, was absent in all types (
Fig. 3C). Furthermore, a group II intron within orf14 was also deleted (
Fig. 3C). Significantly, in type 2, Tn
5397 was missing some of the downstream genes, including an antimicrobial resistance gene
tetM, compared with the reference Tn
5397 (
Fig. 3C). Type 1 displayed the most similar gene composition and structure compared with the reference Tn
5397, although two new genes were inserted (
Fig. 3C).
Tn
5398, first reported in strain 630, contains
erm 1B and
erm 2B, and can be transferred between
C. difficile isolates, or between
C. difficile and
S. aureus or
Bacillus subtilis (
33). Two types of Tn
5398-like elements were identified here, in which
erm 1B and
erm 2B genes were absent (
Fig. 3D). Type 1 consisted of 35, ZR73, ZR82, 16, 23, 28, 29, 4, 5, 50, 6, 7, BJ08, GZ13, GZ14, GZ2, GZ6, HN9, ZR29, ZR65, ZR66, ZR68, ZR72, and ZR8, while type 2 was represented only by isolate 15 (
Table 1).
Tn
916 is a major family transposon reported in
C. difficile and carries the
tetM resistance gene. The Tn
916 element of these tested isolates were subdivided into multiple types as follows: type 1, GZ8; type 2, ZR18; type 3, ZR73; type 4, 23, 29, 4, 6, GZ11, GZ12, GZ13, GZ14, GZ2, GZ3, GZ6, ZR72, ZR8, ZR82, and ZR9; type 5, BJ08 and HN9; type 6, 5; type 7, 2, 28, 38, 7, ZR29, ZR58, ZR59, and ZR66 (
Table 1). The
tetM resistance and
int gene (responsible for Tn insertion), which are responsible for insertion, appeared in all types (
Fig. 2C). With the exception of type 6 with deletions in three large regions, deletions occurred in a single area of the regulation region in all the other types (
Fig. 2C). Intact
xis and
int genes were retained in types 3, 4, 6, and 7. The conjugative region was highly homologous within most types, except for types 3, 6, and 7, which had large fragment deletions (
Fig. 2C). Isolate ZR18 carried no typical
xis and
int genes of Tn
916, while BJ08, GZ8, and HN9 contain
int without
xis.
The representative gene cluster of Tn
6194 carrying
ermB of strain CII7 (GenBank accession no.
HG475346) was used as a reference in this study. Tn
6194 has a conjugation region that is closely related to that of Tn
916 but contains an accessory region that is related to Tn
5398. M68 demonstrated high-level homology with only the absence of cds22 and cds 23, and indels within cds5 and cds 11, while intact
xis and
int genes were retained for excision and insertion (
Fig. 2D). The following six types were defined here: type 1, 16, 2, 28, 38, 5, 7, GZ13, and GZ14; type 2, 23; type 3, ZR66; type 4, ZR29, ZR58, and ZR59; type 5, 29, 4, 6, GZ11, GZ12, GZ2, GZ6, ZR8, and ZR9; type 6, GZ3 (
Table 1).The
ermB gene was present in all types except type 2 (
Fig. 2D). The
xis gene, which is involved in the excision of Tn
916 from the donated strain were deleted in all types (
Fig. 2D).The
int gene, which plays a key role in the integration of Tn
916 into the recipient isolate, appeared in types 2, 5, and 6 only (
Fig. 2D).
(ii) Prophages, plasmids, and CRISPR. Prophages were identified in various amounts in all tested isolates (
Table S2). In total, 497 prophages were predicted in this study. Isolates 2 and ZR66 contained the highest number of prophages (19 prophages), while isolate 15 had the fewest (3 prophages) (
Table S2 and
Fig. 4A). A total of 115 intact prophages was confirmed among in all the isolates, except ZR18, 15, and 11032 (
Fig. 4A). Among the intact prophages, the number of ΦCDHM19, ΦCD506, ΦMMP01, ΦMMP03, ΦMMP02, ΦMMP04, and ΦC2 was more than three per isolate (
Fig. 4B). ΦCDHM19 and ΦCD506 represented more than half (54.78%) of all intact prophages (
Fig. 4B). All of these prophages belonged to the
Myoviridae family, and some were induced during
C. difficile infection (
34). Most prophages identified here were homologous to known phages reported in
C. difficile, although a few were similar to other bacterial phages, such as
C. jejuni,
S. aureus,
Enterobacteria,
Bacillus, and other unusual bacteria, such as
Prochlorococcus,
Gordonia, and
Sphingomonas (
Table S2). Isolates 50 and 35 carried the highest number of intact prophages, 8 and 7, respectively (
Table S2 and
Fig. 4A). The P-SSM2-like phage of
Prochlorococcus found in isolate 50 showed the highest number of proteins similar to those in the region (
Table S2), while the ΦC2-like phage in isolate 35 displayed a high number of “hit_genes_count” in bracket (8 out of 9) with ΦC2 (
Table S2). The prophages found within the ST 37 and ST 81 isolates were diverse, although ΦCD506 was predominant. No obvious correlation was found between prophage type and STs. Future studies will focus on the role of these elements in the
C. difficile evolution and infection.
A total of 464 CRISPR arrays, composed of different copies of direct repeat (DR) sequences, and separated by unique spacers were identified in the tested isolates from clade 4 (
Table S3). The number in each isolate varied from 5 to 19, and the length ranged from 60 to 1,608 bp (
Table S3). In total, 90 of the CRISPR arrays displayed homology with prophages (
Table S3). Isolates 10122, 11032, 29, and GZ8 harbored prophages without homologous CRISPR sequences (
Table S3).
Sixteen potential plasmid sequences with almost 100% coverage were identified among the 37 isolates (
Table S4) and were found to contain important antimicrobial resistance genes (
aadE,
aad9,
ermB,
tetM,
tetS, and
tetO) (
Table S4).
Antimicrobial resistance gene and related antimicrobial susceptibility.
In this study, 60 antibiotic resistance genes were identified by comparison with the ARDB and CARD databases (
35,
36). These antibiotic resistance genes belonged to the following antimicrobial classes: fluoroquinolones, β-lactam antibiotics, macrolides, aminocoumarin, rifampin, glycopeptides, tetracycline, macrolide-lincosamide-streptogramin (MLS), chloramphenicol, trimethoprim, streptomycin, aminoglycoside, bacitracin, and lipopeptide (
Fig. 6). Most isolates carried approximately 13 to 16 copies of the
macB gene, except for ZR18 carrying 9 copies. The next highest numbers of copies were for the
vanRI and
bcrA genes (
Fig. 6). Remarkably, some isolates carried a unique antibiotic resistance gene cassette, for example, ZR18 and CF5 contained a unique antibiotic resistance gene cluster of approximately 6 kb comprising
vanXYG,
vanSG,
vanTG, and
vanG, which were integrated as glycopeptide resistance gene (
Fig. 6). Additionally, ZR18 also carried
catD (carried by Tn
4453a/b), and
cata11, which are related to chloramphenicol resistance, and
tetA(48) (
Fig. 6). No functional vancomycin resistance operon was identified here. Furthermore, isolate 11032 contained unique
tetA(P),
tetT,
tetpb, and
tetpa genes (
Fig. 6). In addition, ZR9 carried a unique
novA gene, and M68 had unique
cata1 and
catl genes (
Fig. 6).
According to the Etest results, all isolates except for 10122 and 35 were MDR strains, maintaining a resistance rate as high as 97.30%. Details are summarized in
Table 2. Almost all MDR strains displayed high levels of drug resistance (
Table 2) with quinolone exhibiting 100% resistance to CIF, followed by LVX (97.30%) and MXF (56.76%) (drug abbreviations in “Antimicrobial susceptibility tests” in Materials and Methods) (
Table 2). The quinolone resistance is due to a point mutation and substitution within
gyrA and/or
gyrB. Clade 4
C. difficile retained high-level resistance (91.89%) to ERY compared with the other isolates of other clades reported in
Table 2. The
ermB gene, encoding an antibiotic target-modifying enzyme, is related to MLS resistance. The
ermB gene was detected in 25 of 34 ERY-resistant isolates (73.53%). ERY resistance is attributed to methylation of position 2058 in 23S rRNA; however, the exact mutation in ERY-resistant
C. difficile isolates remains to be identified in the future. In addition, high copy numbers of the macrolide resistance
macB gene, encoding a macrolide ABC transporter protein, were identified in all the tested isolates (
Fig. 6). The rate of resistance to CLI was 89.19%, with a high level of resistance detected among the isolates. The rate of resistance to CHL was lower (29.73%) and related to
cata11 and
cata1, which were identified only in isolates ZR18 and M68, respectively, indicating the existence of other mechanisms of CHL resistance. It is worth mentioning that
catD, normally carried by
Tn4453a/b and also responsible for CHL resistance (
Fig. 3), was present in isolate ZR18, which showed an intermediate level of CHL resistance (MIC = 64 μg/ml). However,
catD was replaced by another five genes in isolates 7, 2, and 28, which were CHL susceptible. The five new replacement genes included
aac(6′
) aph(2′
), which is correlated significantly with HLGR and is responsible for aminoglycoside resistance, leading to gentamicin and amikacin resistance in isolates 7, 2, and 28. The rates of MEM (51.35%) and RIF (48.65%) resistance were similar among the clade 4
C. difficile isolates. All isolates resistant to CHL and RIF were ST37 (
Table 2). The
rpoB and
rphB genes with several mutations related to RIF resistance were detected in almost all isolates. The mechanism underlying this resistance involves substitution of several nucleotides in the
rpoB gene. None of the clade 4 isolates were resistant to VAN or MET, although VAN-related genes or gene cassettes were identified.