RESULTS AND DISCUSSION
A total of 87
E. coli ST648 genomes were analyzed (
Fig. 1). The pangenome matrix revealed 12,276 coding sequences, with 3,596 genes shared by all 87 genomes that represent the maximum common genome (MCG). Mapping all sequencing reads against the completely closed chromosomal genome of IMT16316, which was also constructed in this study, resulted in an alignment of 3,883 Mbp. This alignment contained 3,834 informative single-nucleotide polymorphism (SNP) sites, which were used to estimate a maximum-likelihood phylogeny (
Fig. 1A). The resulting tree revealed a ST648 lineage with low genetic diversity, divided into only four clades. Clades 1 and 2 differed by approximately 200 SNPs, clades 1 and 4 differed by approximately by 550 SNPs, and clade 3 was the most distant (an ∼600-SNP difference compared to clades 1 and 2). These results were confirmed by a Bayesian clustering using a Bayesian analysis of the population structure (BAPS) (
Fig. 1) and are in contrast to lineage ST131, for which studies have identified approximately 7,000 to 9,000 SNPs between the three main clades (A, B, and C, with the latter comprising subclades C1 and C2, corresponding to H30R and H30Rx) (
1,
10). It should be noted that a selection bias in the strain set may have influenced our results; however, we estimate the odds to be rather low.
Notably, ST648 clade 2 strains almost ubiquitously carried an identical ESBL plasmid, as discussed below. Strains from humans and animals with different backgrounds were distributed throughout the phylogeny (
Fig. 1A), suggesting ST648’s interspecies movement and zoonotic potential while lacking clear genomic signatures that would indicate ecological adaptation to individual host species and infection sites. The geographical distributions were also similar, where strains from different continents occurred throughout the phylogeny (
Fig. 1A). These results suggest that ST648 is a host generalist rather than a specialist, with the capability of frequent cross-species transmission, thriving in different clinical and nonclinical contexts, similar to ST131 (8). By including strains from multiple geographic regions and wildlife (
Fig. 1A), unique insights into this newly emerging clonal lineage were discovered beyond the usually narrower human, clinically centered focus.
We analyzed the accessory gene profiles of the 87 ST648 genomes based on a total size of a variable pangenome of 8,680 genes, which provides information about past evolutionary events in this population, in addition to the simpler focus on plasmid-related gene contents (
Fig. 1A). The accessory genome did not cluster with any assigned metadata groups (i.e., country, host, and origin) and only to a limited extent with the core genome-based clades, where clade 2 showed a slightly different accessory gene profile (
Fig. 1A). These results are in contrast to those observed with ST131, where McNally et al. reported several subtypes within ST131 clade C based on highly congruent accessory gene profiles (
8).
By estimating the substitution rates and creating a time-scaled tree with estimated dates of the internal nodes, we were able to date important events within the microevolution of this lineage (
Fig. 1B). The first split of the lineage dates back to 2001, and the more distantly related clades 3 and 4 separated from the branch-containing clades 1 and 2 in 2004. The most recent event in forming the ST648 population structure was the separation between clusters 1 and 2 in 2006. Along with the observation that the accessory genome did not have sufficient time to diversify along the shallow core lineages, these results suggest a young, nascent ST648 lineage. For ST131, Stoesser et al. estimated the time to the most recent common ancestor to be significantly longer, about 130 years ago, when clade A diverged from clades B and C, followed by a split of clades B and C 25 years ago (
1).
To investigate whether ST648 and ST131 strains were similarly virulent while being more virulent than ancestral ST10, we performed a genomic analysis of all ST648, ST131, and ST10 genomes for virulence-associated genes (VAGs) typically associated with pathogenic
E. coli. Colonization, iron uptake, and biofilm formation are key enabling factors for the clinical success of ST131 (
11). Although biofilm formation contributes to bacterial pathogenesis through features such as immune evasion and adhesion, bacterial iron acquisition is especially important during infection as host-associated proteins sequester and thus deplete metals at the infection site (
12). Based on a manually curated database with 75 VAGs highly relevant in pathogenic
E. coli (
13–15), the results revealed the highest overall mean for ST131 (36.2) per strain, closely followed by ST648 (31.1,
P = 0.681). In contrast, ST10 had significantly fewer VAGs (19.6,
P = 0.004) (see Fig. S1A in the supplemental material).
While biofilm/adherence- and iron uptake-associated genes were differently distributed, we observed similarities in factors related to invasion, toxins, and capsule-associated virulence. ST131 strains carried 21.7 biofilm/adherence-associated genes on average, and ST648 strains harbored an equal number (mean, 20.4;
P = 0.662), while the number was again significantly lower in ST10 strains (mean, 13.0;
P = 0.046). Similar ordering was observed in iron uptake-related genes (ST131 [mean, 8.6], ST648 [mean, 5.6;
P = 0.159], and ST10 [mean, 2.2;
P = 0.007]), which aligns well with earlier findings showing that ST131 is an important cause of urinary tract infections (UTIs) and follows in the context that strains belonging to phylogroup B2 often carry virulence factors, particularly those associated with extraintestinal infection, in contrast to phylogroup A. Iron acquisition is a particularly crucial step for urine pathogen survival in this largely metal-depleted ecological niche (
16).
To further elucidate the importance of observed differences, we investigated whether ST648, ST131, and ST10 carried clusters of particular biofilm/adherence- and iron uptake-related gene sets indicative of host niche adaptation (Fig. S2). This was true for adherence-associated genes (the
pap operon,
afaA,
iha,
nfaE, and
focX) in ST131. We found two large clusters (nodes 4 and 5) pointing toward their sophisticated adaptation to ecological niches where these VAGs are necessary, such as colonizing the urinary tract (Fig. S2B) (
17,
18). Actually, most node 4 and 5 ST131 strains were isolated from UTIs (87.5%). ST648 patterns were not as clearly associated with UTI-relevant adhesion factors (i.e., the lack of
pap genes in nodes 1 and 2), providing no evidence for UTI-related niche adaption in ST648 strains. This observation coincides with the finding of Vangchhia et al. that phylogroup F strains seemingly tend to lack virulence factors typically associated with extraintestinal infections (
19).
In contrast to the separation of adhesion-related gene clusters according to their ST (Fig. S2B), we observed mixed clusters for iron uptake-related genes with the same set of genes present in strains from different STs (Fig. S2A). The largest iron uptake cluster (node 1) consisted of both ST131 and ST648 strains (Fig. S2A, node 1: chuA, fyuA, irp2, iucD, iutA, sitA, sitB, sitC, and sitD) but no ST10 strains. Iron uptake is also important during infection outside the urinary tract. ChuA, for instance, enables bacteria to use hemoglobin as an iron source. The presence of a set of virulence genes such as chuA in both ST131 and ST648 underlines their virulence potential in different clinical contexts. Overall, the VAG analysis of ST648, ST131, and ST10 provides evidence that the differences in the overall numbers of VAGs are caused by certain clusters of adhesion and iron uptake-related genes indicative of pathogen niche adaption, especially in ST131 strains.
To functionally examine our genomic results on a phenotypic level, we performed siderophore secretion/iron uptake experiments for 60 randomly selected ST648, ST131, and ST10 strains. ST131 was superior (85%) to ST648 (25%) and ST10 (20%) in iron acquisition, which partly accords with the above-mentioned in silico results, while following in the context of its history as a major cause of UTIs (Fig. S1). However, since a basic set of iron uptake genes was shared by ST648 and ST131 (Fig. S2A, node 1: chuA, fyuA, irp2, iucD, and sitA, sitB, sitC, and sitD), the superiority of ST131 strains in iron uptake is a noteworthy finding.
As biofilm formation was also suggested by the
in silico results to be of particular interest, we performed phenotypic macrocolony experiments for all available ST648, ST131, and ST10 strains to gain a general overview of curli fibers and cellulose production, which are major biofilm components (
20) (
Fig. 1A and
Fig. 2). Curli fibers are considered virulence factors, since they contribute to the adherence of
E. coli to epithelial cells (
21). Bacteria mostly produce cellulose for protective reasons, and cellulose has been described in a survival context on abiotic surfaces (
22), demonstrating their potential to persist in both host and environment when expressing the relevant genes.
We observed significant differences in biofilm formation between the different STs (
Fig. 2B). The majority of ST648 (76%) produced curli fibers or cellulose, in contrast to ST10 (39%,
P < 0.001) and, interestingly, also ST131 (66%,
P = 0.027) (
Fig. 2B). As for both curli and cellulose production, ST648 again outcompeted (48%) ST131 (19.6% [
P < 0.001]) and ST10 (18.75% [
P < 0.001]). Although lacking typical UTI-associated adherence genes such as the
pap gene group, ST648 frequently produced major biofilm components. Furthermore, we observed a negative correlation between the ability to produce these components and the bacteria’s motile behavior, which confirms previous observations (
9). Most nonmotile strains showed curli and/or cellulose production, whereas the lack of such production was correlated with motility (
Fig. 1A). Particularly striking was clade 2, which most pronouncedly consisted of strains producing curli and/or cellulose while demonstrating reduced motility (
Fig. 1A).
Based on a comparative genomics analysis using a PacBio-sequenced reference (IMT16316), most of these strains carried an identical IncF-type ESBL plasmid, except for IMT24495 and IMT33149 (
Fig. 2A). These two strains did not produce biofilm-related factors. Since we have previously demonstrated that ESBL plasmids seemingly influenced biofilm formation in ST648 and ST131
E. coli (
8), we constructed ESBL plasmid-cured variants (PCVs), complemented with the above-mentioned clade 2 plasmid (pEcIMT16316 and IncF[F1:A1:B49]), and repeated the biofilm and motility experiments. With this complemented ESBL plasmid, both strains produced curli fibers and cellulose, while losing their motile behavior. Other closely related ESBL plasmids of the same IncF subtype carried by strains outside cluster 2 showed similar phenotypes, supporting our previous hypothesis that ESBL plasmids confer advantageous traits in certain
E. coli strains. Although the exact contribution of VAGs to colonization and fitness remains to be investigated in-depth for most
E. coli lineages, they have been a part of enabling the success of ST131, and it appears plausible that the same applies to ST648.
Several publications have reported the frequent occurrence of ESBL-producing ST131 and ST648 strains in wild-bird populations and poultry, possibly pointing toward an increased colonization ability of ESBL
E. coli in the avian gut (
23–25). To test whether ST131 and ST648 would differ in this respect
in vivo, we performed colonization experiments for ST648 and ST131 in chicken intestines. Four randomly selected ST648 strains of clade 2 (IMT16316, IMT17887, IMT21183, and IMT23463) were compared to four ST131 strains (IMT17433, IMT19205, IMT19224, and IMT27685). Thirty days after oral bacterial inoculation (10
4 CFU per animal), the cecum and jejunum contents of eight animals per strain were characterized and quantified. We did not detect significant differences (cecum,
P = 0.429; jejunum,
P = 0.219) in the overall numbers of ST648 compared to ST131 strains, which ranged from 10
6 to 10
8 CFU/g (cecum) and 10
2 to 10
5 CFU/g (jejunum) (
Fig. 3). This underlines the similarly high potentials of both lineages to colonize birds
in vivo.
Following colonization, host invasion presents the next crucial step for successful pathogenesis. It has been recently suggested that the abundance of certain
E. coli lineages in invasive infections such as bacteremia is driven by negative frequency-dependent selection (NFDS). So far, this has resulted in a relatively steady population structure of different phylogenetic lineages only disrupted by the emergence of ST131, as demonstrated in a large-scale bacteremia study performed in the United Kingdom. To also predict the invasive success of ST648, we applied the NFDS model to the above-mentioned UK BSAC bacteremia sample set (
26). Our results show that ST648 will expectedly increase to a stable frequency, with a maximum near 1.5% among all populations (Fig. S3A). Comparative simulations with a neutral model do not predict the reproductive success of ST648 (Fig. S3B), while the alternative NFDS model, which lacks migration, results in most simulations losing the low-frequency strain. Notably, however, if ST648 survives long enough, it will reach the same prevalence as in other NFDS simulations (Fig. S3C). This indicates that the predicted future prevalence is maintained by the NFDS model, in contrast to having arisen as an artifact of migration. ST648 is predicted to increase 10-fold in frequency as a cause of bacteremia over the next few years but to remain at much lower levels than ST131.
Since the combination of virulence with multidrug resistance (MDR) has been considered crucial to the global emergence of ST131 in clinical settings (
2,
27,
28), we sought finally to investigate antimicrobial resistance properties of our strains. A total of 94% of ST648 strains were MDR, followed by ST131 (72%) and ST10 (50%) resulting from an
in silico resistance screening (see Table S2 in the supplemental material). MDR was significantly increased in the ST648 group compared to ST10 (
P < 0.001) and also ST131 (
P < 0.001). Genes imparting resistance to non-beta-lactams included aminoglycosides [
aac(3)-II a/d, aadA5,
aph(3′)-Ia, aac(6′)Ib-cr, and
strA/B], sulfonamides (
sul1,
sul2), trimethoprim (
dfrA14,
dfrA17), tetracycline (
tetA and
tetB), and fenicoles (
catB3). The ESBL-type
blaCTX-M-15 was the dominant gene in ST648 and ST131 strains. The frequent presence of MDR in ST648 strains makes them likely clinically competitive compared to ST131.
ESBL-producing E. coli are leading MDR pathogens, headed by a few internationally relevant, high-risk clonal lineages with ST131 as the most prominent. ST648 presents an emerging lineage increasingly reported from multiple origins with the greatest potential to follow ST131’s success. We applied comparative genomics and functional experiments to a large collection of ESBL-producing E. coli strains to unravel ST648’s population structure and to elucidate the mechanisms that contributed to its emergence across different ecologies. Our work highlights its nascent, generalist character, while combining MDR and virulence on a level similar to ST131. Both have seemingly developed individually sophisticated features possibly conveying advantages in different ecologies: ST648 strains significantly more often produced biofilm-related curli fibers and cellulose and had similar numbers of biofilm/adherence virulence-associated genes, while ST131 strains efficiently acquired iron. The supposed “superiority” of ST131 in some virulence characteristics (e.g., the propensity to cause UTIs) might be outbalanced by ST648’s biofilm-forming properties and increased antimicrobial resistance.
Nevertheless, results stemming from
in silico and
in vitro experiments alone cannot serve to differentiate and fully explain the success of distinct
E. coli lineages. The ability of ST131 and ST648 to effectively colonize
in vivo, which has been previously reported for ST131 (
3) and was confirmed in the present study, is another major contributor to virulence. In addition, ESBL plasmids conferring beneficial traits for colonization and environmental survival in different ecologies likely aided in shaping and maintaining ST648 strains to being similarly successful. However, in addition to possessing an advantageous geno- and phenotypic makeup, emerging novel lineages have to be common enough to be able to spread in clinical and extraclinical environments. Our ST648 NFDS bacteremia modeling clearly underlines the fact that ST648 will increase in frequency. Although the prediction points toward lower levels than ST131, ST648 is well on its way to becoming another internationally circulating, high-risk clonal lineage, worsening infection treatment possibilities not only due to antimicrobial resistance but also due to other features.
These findings provide strong incentive for further in-depth studies of ST648’s properties and careful surveillance of its emergence across different ecologies.