INTRODUCTION
The antibiotic resistance crisis is growing steadily across almost every host and environment tested worldwide (
1). These trends are especially concerning in common and emerging pathogens, including extraintestinal pathogenic
Escherichia coli (ExPEC) (
2). ExPEC cause a wide range of opportunistic infections that have proved to be a challenge to define and diagnose with accuracy, and increasingly are resistant to multiple antimicrobial agents (
3,
4). The number of
E. coli sequence types (ST) identified to have caused ExPEC infections in humans and animals continues to rise (
5,
6). Whole-genome sequencing programs are shedding light on the phylogeny of emerging ExPEC, improving efforts to define their virulence and antibiotic resistance gene (ARG) pool, but the emphasis to date has overwhelmingly been on pathogens of human clinical relevance (
7–10).
E. coli ST457 belongs to Clermont phylogroup F (
11) and was first reported in 2008 (sampled between 2004 and 2005) in the United Kingdom from a patient with a urinary tract infection (UTI) (
12). ST457 has since been reported in studies from all continents and from diverse sources (
13–18), including wild animals in Antarctica (
19). These observations indicate that ST457 is a broad host range and globally disseminated
E. coli lineage.
ST457 often carries resistance to clinically important antibiotics. Carbapenemase-producing clinical strains of
E. coli ST457 from patients with bloodstream infection in Italy and Mexico (KPC-2), in Shanghai (NDM-5), and in the United States (KPC-3) have been reported (
17,
20–22). ST457 is frequently associated with the production of CTX-M (mainly from the CTX-M-1 group: CTX-M-15, CTX-M-27, and CTX-M-55) and CMY-2 β-lactamases in diverse countries, including Australia, Brazil, Mexico, United States, Canada, Italy, China, and Thailand (
13,
16,
17,
23–27).
E. coli ST457 is also often reported to carry plasmid-associated
mcr genes that encode resistance to colistin, an antibiotic of last resort. Reports of the isolation of ST457 carrying
mcr-1 from patients residing in the United States, Mexico, China, and Vietnam (
16,
24,
25,
28) are concerning. Reports of
mcr-1-carrying ST457 causing mastitis in dairy cattle in Japan (
14) and
mcr-1-positive ST457 strains in wildlife and in poultry, predominantly in Asia (
26,
29), shed light on the scale by which ST457 has spread. Notably, ST457 carrying
mcr-3 has been isolated from a wild bird in China (
30).
In poultry production systems in Paraguay, we described a multidrug-resistant (MDR) ST457 clone carrying
mcr-5,
blaCTX-M-8,
blaTEM-1A,
aph(
6)-
Id and
aph-Ib, and
sul2 (
31).
mcr-5 was typically located on an I1/ST113 plasmid that coharbored
blaCTX-M-8. I1/ST113 plasmids have been previously described to spread
blaCTX-M-8 in Japan (
32), Brazil (
33), and Germany (
34). I2 plasmids recovered from
E. coli ST457 were found to separately carry
mcr-3 in China and
mcr-1 in a clinical strain in Mexico and poultry strains from China, respectively (
24,
26,
30).
blaCMY-2 has also been detected on I1 plasmids carried by
E. coli ST457 in poultry, wildlife, and companion animals in Brazil (
15,
23,
35), underscoring the important role played by I1 plasmids in capturing and spreading genes encoding resistance to clinically important antibiotics in ST457, as previously shown in other
E. coli lineages (
15). The I1 plasmids belonged to various STs, including ST12, which is a notable plasmid lineage spreading
blaCMY-2 globally (
36).
blaCTX-M-55 was carried by F plasmids, specifically F18:A-:B1, in a clinical strain of ST457 from the United States and F33:A-:B- in poultry from Brazil (
16,
23).
MDR
E. coli are a feature of the fecal contents of silver gulls (
Chroicocephalus novaehollandiae) nesting on several islands off the coast of New South Wales, Australia. In an earlier study in 2012, Dolejska et al. (
37) described
E. coli belonging to diverse sequence types that carried carbapenemase genes, including
blaIMP-4,
blaIMP-38, and
blaIMP-26, and extended-spectrum-β-lactamase genes such as
blaCMY-2 from the cloacal contents of 504 gull chicks inhabiting three islands off the coast of New South Wales, Australia. Subsequent analyses, described here, showed that
E. coli ST457 were a feature of isolates recovered during that study. Specifically, we sought to determine if resistance to clinically important antibiotics is acquired and mobilized among the
E. coli ST457 sourced from these silver gulls by conducting whole-genome sequencing. We used a combination of short- and long-read DNA sequencing and hybrid assemblies to generate complete sequences of the chromosome and plasmid cargo of an ST457 strain representative of the panel of ST457 strains from Australian gulls. We also recovered
E. coli ST457 sequences from EnteroBase (
https://enterobase.warwick.ac.uk/) from different hosts and countries and performed detailed phylogenetic analyses to examine clade structure and genomic features of ST457.
(Part of this work was presented in the abstract book of the 30th European Congress of Clinical Microbiology and Infectious Diseases [ECCMID] 2020 [Abstract no. 3781; event cancelled] and as a poster at the One Health EJP Annual Scientific Meeting 2020 [online event].)
RESULTS
Phylogeny.
All 138 strains of
E. coli ST457 belonged to phylogenetic group F and carried
fimH145. Based on our single-nucleotide polymorphism (SNP) analysis, the collection of ST457 partitioned into five main clades, largely in accordance to flagella H-antigen carriage, and included H25 South, H25 North, H6, H45, and H16 and two minor clades (identified here as MC1 and MC2) (
Fig. 1; Table S1 in the supplemental material). Serogroup O11 dominated the collection (79%; 109/138), as did the H25 flagella antigen (67%; 92/138) and, unsurprisingly, serotype O11:H25 predominated (52%, 72/138). ST457 with O11:H25 was found in three out of the five main clades and it was the predominant serotype in two out of those three clades (H25 North, H25 South). Several clade-specific serotypes, including O11:H16, O11:H6, and O11:H45, were also detected. The geographical origin of the strains varied even at the clade level, including countries from North and South America, Europe, Australia, and Asia. Interestingly, even when the gull strains from Australia were excluded, four clades contained strains of Australian origin.
The H25 North clade contained 28 strains and compared to the other clades the SNP analysis showed considerable diversity (>1,500 SNPs on average) (Table S2, H25 North). However, four separate subclades (up to 20 SNPs different) were detected within the clade in three bovine strains from Luxembourg (HA9247, HA9232, and HA9243), four human strains from the United States (RA8202, RA8205, RA8209, and RA8206), two clinical strains from China (XA6422 and WA4383), and two strains from the UK (TA7464 and SA4138). Notably, the UK strains, the U.S. strains, and a single strain from Cambodia (WA7349) and Australia (FB3603) created a closely related subclade with differences between 39 to 44 SNPs (87 to 92 SNPs for the Australian strain). We would like to highlight a close phylogenetic relation between human and food-producing animal strains observed in the H25 North clade. Strain GB0145 from Brazilian swine was separated by 46 SNPs from human strain FB5883 from China, while strain PA1693 from poultry in Vietnam differed by 69 SNPs from YA1066, a human clinical strain from China.
The strains of the H25 North clade differed in ARGs and virulence-associated genes (VAGs); however, the resistance gene cargo was overall higher than in the other four ST457 clades. An average strain from H25 North clade carried twice as many ARGs as an average strain from the H16 or H6 clades, and for the H25 South and H45 clades, the ratio was even higher. All strains in the H25 North clade lacked the yersiniabactin siderophore gene cluster (
fyuA,
irp1,
irp2, and
ybtAEPQSTUX) compared to the other four ST457 clades (
Fig. 2).
The H16 clade comprised 7 strains and consisted mostly of bovine strains from the Netherlands. Strains VA5580, VA5582, and VA5574 were clonal and differed in 3 to 9 SNPs (Table S2, H16). All six Dutch strains were closely related, as the number of SNPs between the two most distinct strains was only 72. Interestingly, a single Australian human strain was part of this clade but was phylogenetically distant (approximately 3,700 SNPs) from the others in the clade. It also carried different ARGs, including blaIMP-4, and VAGs cargo and housed different plasmid replicons (Fig. S1).
The H6 clade (29 strains) was mostly of poultry origin from South and North America, in part due to the inclusion of our collection of mostly clonal Paraguayan poultry strains (n = 14; 1508e to 1527k). Surprisingly, the H6 clade contained four strains from Japan, including three human isolates. Notably, Japanese human strain QA5359 was closely related to human Peruvian strain VA5961 (69 SNPs). Moreover, Japanese human strain QA5364 showed close similarity with U.S. canine strain SA0179 (106 SNPs) and three U.S. poultry strains ZA5550 (124 SNPs), VA9753 (132 SNPs), MA1963 (146 SNPs) (Table S2, H6).
A close similarity was evident between Australian human clinical and gull strains in the H45 clade (14 strains). Gull strain CE1765 differed from the two clinical strains WA5506 and WA5541 by 21 and 33 SNPs, respectively, highlighting potential zoonotic/zooanthroponosis (reverse zoonosis) linkages with ST457 in Australia (Table S2, H45).
The H25 South clade (55 strains) of ST457 contained most of the Australian gull strains. Only five strains, gull strain CE1803A, Argentinian human strain RA2540, and three clonal U.K. human strains (GA9323, GA9192, and GA9368) displayed a large number of SNPs (>650) compared to the remaining strains in the clade (Table S2, H25 South). The other 50 strains originating from Australia, Germany, the United States, Japan, New Zealand, and the Netherlands from 2009 to 2019 differed by ≤181 SNPs. Several strains from different countries or sources were closely related within the H25 South clade (up to 80 SNPs); for example, U.S. poultry strain FB9087 possessed 76 and 77 SNPs compared to German human strain RA5006 and one of the gull strains CE1679, respectively.
Subsets of the human/clinical strains from Australia were clonal (EC0125 and EC0382, 1 SNP difference; WA5591 and WA5546, 10 SNP difference) despite being isolated from different locations (Wollongong and Sydney) or different years (2009 and 2014). Similarly, we detected many apparent clones (≤20 SNP differences) within the collection of isolates from silver gulls. Close genetic associations were also evident when comparing strains from Australian gulls with Australian human strains of ST457 (≤30 SNP differences between strains within these two categories). Human isolate WA5546 differed by up to 30 SNPs with 11 gull strains and, notably, only 10 SNPs with gull strain CE1966 (Table S2, H25 South). Of interest, ST457 strains CE1679 (gull) and IA1961 (Laughing Kookaburra Dacelo novaeguineae) isolated in Queensland showed only 20 SNP differences. The highest prevalence of ARGs and VAGs was present in 12 non-Australian ST457 strains (5.5 ARGs per non-Australian strain on average compared to 1.1 ARGs on average in Australian strains) in this clade. Phylogeny of the accessory genome showed that Australian strains (except strain CE1803A) of the H25 South clade clustered together while other strains of this clade were distant and some partitioned with different accessory clusters (Fig. S2). Interestingly, when comparing accessory genomes, all strains from H25 North clade clustered together. This was also observed for strains belonging to the H45 clade. On the other hand, all strains from the Netherlands clustered together in accessory genome phylogeny despite being part of different core genome clades and coming from different sources (Fig. S2).
Virulence-associated genes.
All ST457 strains carried diverse VAGs ranging from 43 to 75 per strain (Fig. S1). VAGs shared by all were those that play a role in heme/iron acquisition, e.g., enterobactin-related genes
fepB,
fes,
entB, or
chuY and
shuA. All strains carried genes encoding fimbria and adhesins, including
fimH or
ecpD, the outer membrane protein gene
ompA, and the invasion-associated gene
aslA. However, in four ST457 strains,
aslA was expected to be nonfunctional because of truncation (Fig. S1). Yersiniabactin siderophore genes
fyuA,
irp1,
irp2, and
ybtAEPQSTUX were detected in most strains (76%, 105/138). K antigens genes of group II, including
kpsD (96%, 132/138) and
kpsM (86%, 118/138), were found in most strains, while the capsule regions were variable. Besides
kpsM and
fyuA, the other commonly reported VAGs recognized as potential ExPEC or APEC markers (
19,
38,
88) were present in lower number in our strains, including
papC (8%; 11/138),
iroN (14%; 20/138),
hlyA (3%; 4/138), or
sfa/focD/cnf1 (1%; 1/138).
Antibiotic resistance genes.
We detected genes encoding resistance to 13 antimicrobials groups: β-lactams (90%; 124/138), aminoglycosides (54%; 75/138), sulfonamides (54%; 75/138), tetracyclines (38%; 53/138), trimethoprim (32%; 43/138), amphenicols (22%; 31/138), colistin (18%; 26/138), macrolides (17%; 23/138), fosfomycine (5%; 7/138), quinolones (4%; 6/138), carbapenems (2%; 3/138), lincosamides (2%; 3/138), and rifampin (1%; 1/138).
The five main clades were quite diverse in resistance gene cargo both qualitatively and quantitatively. An average strain from the H25 North clade carried more ARGs (13.0 genes per strain) compared with ST457 strains from H6 (4.9) or H16 (6.1) clades. The gull-associated clades H45 (1.1) and H25 South (2.6) that were prevalent in Australia typically carried limited resistance gene cargo.
Extended spectrum β-lactamase (ESBL) and AmpC β-lactamase genes were frequently identified in the ST457 strains. The most common genes of this group were blaCMY-2 (47%, 65/138), associated mostly with the H25 South, H45, and H16 clades, and blaCTX-M-1; 2; 3; 8; 12; 14; 27; 55 variants found in 46/138 strains (33%). blaCTX-M-8 and blaCTX-M-27 were the most prevalent among these variants (15/138, 11% each). blaCTX-M-8 was found mostly in Paraguayan poultry (H6 clade) and in a Bolivian human strain (H25 North), while blaCTX-M-27 was associated only with the H25 North clade with 54% (15/28) clade prevalence. blaCTX-M-55 was found with 23% (7/28) and 3% (1/29) clade prevalence in H25 North and H6 clades, respectively.
Genes encoding resistance to sulfonamides were frequently identified as follows: sul2 47% (65/138), sul1 13% (18/138), and sul3 12% (16/138). However, sul2 was more prevalent in H25 North, H16, and H6 clades, where it was detected in 76 to 86% of ST457 strains. sul3 was found only in the H25 North clade in 57% (16/28) of strains, for which only five of them were clonal.
Carbapenemase genes were found sporadically in three strains only. Specifically, we identified
blaIMP-4 in an Australian human strain,
blaNDM-9 in a poultry strain from China, and
blaOXA-23, which was located on the chromosome of CE1628, from the Australian gull collection (
Fig. 3).
Plasmid-mediated resistance to colistin was represented by carriage of mcr-1.1 or mcr-2.1 in 46% (13/28) of strains in the H25 North clade. In the H6 clade, the occurrence of mcr-5.1 (45%, 13/39) was high but, in contrast to the diversity of strains in the H25 North clade, was due to clonal spread on Paraguayan farms.
Overall, we detected 10 strains, mostly from the H45 clade, which did not carry any ARGs. In contrast, ST457 strain HA3078 from a poultry source in China carried 29 resistance genes and 9 plasmid replicons, indicating that ST457 can carry extensive resistance gene cargo on diverse plasmids.
The Australian gull and human collections from H25 South and H45 clades were quite unified regarding resistance gene content. Most of these strains carried only the plasmid-borne gene blaCMY-2; however, CE1887 gull strain carried 13 resistance genes, including blaCMY-2 and 7 plasmid replicons.
blaOXA-23 and carbapenemase-related phenotypic characteristics.
A region of DNA from the chromosome of gull strain CE1628 containing
blaOXA-23 (GenBank
MT468652) carried multiple mobile elements and other resistance genes, including
sul2,
aph(
3′)-
Ib,
aph(
6)-
Id, and
aph(
3′)-
Ia, and a mercury resistance operon (
Fig. 3). Using BLASTn, we did not find a full match of this region with sequences deposited in public databases. The genetic structure suggests it is comprised of separate regions derived from apparently unrelated sources and its path to formation remains unknown. While the region spanning
blaOXA-23 is typical of those observed in
Acinetobacter spp., subregions of it have been seen within
E. coli plasmid
KJ716226 (
39) (
Fig. 3).
Strain CE1628 was positive for carbapenemase activity using a matrix-assisted laser desorption ionization (MALDI) assay, however, the activity was low. MIC values showed the strain was resistant to five extended-spectrum β-lactam antibiotics or their combination with β-lactamases inhibitors as follows: ampicillin (MIC > 128 mg/liter), ampicillin-sulbactam (32 mg/liter), cefuroxime (64 mg/liter), cefotaxime (>8 mg/liter), and cefprozil (64 mg/liter). The strain was susceptible to meropenem (<0.125 mg/liter) and intermediate to ertapenem (1 mg/liter) (Table S3). In contrast to our findings, a previously described
E. coli strain carrying the
blaOXA-23 region on plasmid
KJ716226 showed higher MIC values for carbapenems: meropenem 4 mg/liter (resistant) and ertapenem 6 mg/liter (resistant) (
39). MIC values for all tested antibiotics and their interpretation using CLSI breakpoints (
40) can be found in Table S3.
Plasmids carried by E. coli ST457 and their transferability.
ST457 strains carried various plasmid replicons, including different Col-type replicons, C, B/O/K/Z, FIA, FIB, FIC, FII, HI1A, HI1B, HI2, I1, I2, L/M, N, X1, X2, X4, Y, and p0111. F plasmids (61%; 84/138), particularly FIB (49%, 68/138), I1 (59%; 81/138), and I2 (18%; 25/138) predominated among large plasmids (Fig. S1). While I1 plasmids were prevalent mostly in the H25 South (82%, 45/55), H45 (71%, 10/14), and H6 (62%, 18/29) clades, F plasmids were common in the H25 North (89%, 25/28), H16 (86%, 6/7), and H6 (83%, 24/29) clades. I2 plasmids were frequently identified in H25 North (36%, 10/28) and H45 (29%, 4/14) clades.
Previously, we demonstrated transfer of I1/ST113 plasmids carrying
blaCTX-M-8 and
mcr-5 using Paraguayan poultry strains as a source to recipient
E. coli strains via conjugation (
31). Using representative ST457 from gulls, successful conjugal transfer was achieved in 9 of 13 strains that carried I1 plasmids with
blaCMY-2 and in a single strain carrying an F18:A-:B1 plasmid with
blaDHA-1.
I1 plasmids.
I1/ST23 plasmids disseminating
blaCMY-2 showed high prevalence in Australia, as we detected their presence in most of the ST457 gull strain collection (81%; 34/42). Only one gull strain did not carry an I1 plasmid, whereas the rest carried I1-
blaCMY-2 plasmids with novel ST315 (10%, 4/42) and ST314 (7%, 3/42) sequence types. From 12 Australian human and clinical strains in the H25 South and H45 clades, 67% (8/12) carried I1/ST23 plasmids. This plasmid was also detected in Australian avian strain IA1961 (
Dacelo novaeguineae). The complete sequence of a 94,698-bp I1/ST23 plasmid was generated here from
E. coli strain CE1628 by combining PacBio and Illumina data and performing a hybrid assembly using Unicycler (GenBank MT468651). The polished sequence was used as a reference for BRIG analyses (
Figure 4a) and showed that closely related I1/ST23 plasmids are shared by a subset of human and gull strains (
Fig. 4). Not all gull plasmids were identical, but all of them encoded the same resistance structure comprising
blaCMY-2 and
sugE (resistance to quaternary ammonium compounds) flanked by IS
1294 (
Figure 4b). The region spanning
blaCMY-2 comprises 6,053 bp and showed 100% sequence identity with previously described plasmids with I1 and C replicons (
41,
42). An I1/ST265 plasmid (pS10584; 94,697 bp) in
Salmonella enterica strain SJTUF10584 isolated in Shanghai, China showed 99.9% sequence identity with our gull reference plasmid using BLASTn.
Figure 4, which depicts pS10584 and selected I1/ST23 plasmids sourced from wild birds and humans, shows that identical plasmids were found in
E. coli strains from human and gull sources plus
S. enterica strain SJTUF10584. A BRIG comparison of all ST23, ST314, and ST315 plasmids that carried the same
blaCMY-2 region brings the total to 51 related plasmids (Fig. S3).
We previously described the dissemination of I1/ST113 plasmids carrying
blaCTX-M-8 and
mcr-5 in Paraguayan farms (
31) but we did not detect I1/ST113 plasmids in other strains from the ST457 collection.
I1/ST12 plasmids carrying blaCMY-2 were found only in five strains of ST457, but while other I1 plasmids were strictly country associated, this plasmid sublineage was found in different clades of ST457 internationally, including in strains from the United States (poultry), China (poultry), Brazil (swine), and Germany (human).
F plasmids.
We found F replicons (FIB, FII, FIC, FIA) in 61% (84/138) of the ST457 strains. Carriage of these plasmid replicons was dominant in the H25 North, H6, and H16 clades. We noticed that several combinations of F replicons were repetitive (
Fig. 2). F29:A-:B- plasmids (
n = 9) were detected in a subset of Australian gulls, all of them from the same location (Montague Island). F64:A-:B27 (
n = 14) was related to Paraguayan poultry. Interestingly, F2:A-:B1 (
n = 11) plasmids were more related to the country of their origin than with clonal relationships of the strains that carried them, as they were found in all strains from the Netherlands (bovine and human origin) even if they belonged to different clades (H25 South and H25 North). Moreover, F2:A:B1 plasmids were detected in two poultry strains from the United States (H6 and H25 South) and a human strain from Japan (H6).
F24:A-:B1 plasmids (n = 15) were prevalent in the H25 North clade (50%; 14/28) and found in one Japanese strain from a human in the H6 clade. A similar situation was observed for F18:A-:B1 plasmids (n = 10), which were prevalent in the H25 North clade (29%; 8/28) but present as well in H6 and H25 South clades, with one identified in a Japanese strain (water source) and one detected in a gull strain isolated in Australia.
ST457-specific chromosomal regions of difference.
We detected 141 regions of difference (RODs) in the ST457 chromosome (referred to as AA to FK, Table S4 and Table S5) which were shared by all three completed ST457 genome sequences and absent in E. coli K-12. Ninety-six of these 141 RODs were detected in 95% of our ST457 strains, suggesting they may play a role in the expression of features unique to E. coli ST457 (Table S4). RAST identified more than 300 proteins in shared RODs and many of these had functions involved in toxin/antitoxin systems, including RelE-like proteins (AC, AR, and CZ), HicAB-like proteins (AG), HigB (DS), RelB (BR), and ParDE (CN). Two RODs were associated with heme uptake (AI and ED), many encoded putative adhesins (AW, CJ, CR, DR, EP, and FI) and general secretion pathway proteins (FC). All ST457 strains also possessed a region (DY) for efflux of antibiotics and RODs with MFS-type transporters (EI, EJ, and EK). Notably, however, approximately 36% (35/96) of all the shared RODs carried genes encoding proteins which seemed to be related to metabolism and substrate uptake and transport across cell membranes. These included proteins for metabolism of sucrose (EW), cellobiose (DP), arabinose (AA and EY), galactose (AA), xylose (AA, AK, and BX), 3-phospho-d-glycerate (EY), serine (AM, BU, and EI), histidine (CW), aspartate (CW and ER), N-acetylgalactosamine (FJ), and hydroxyaromatic compounds (EU).
DISCUSSION
Tracking diverse
E. coli sequence types is becoming critically important in the One Health understanding of disease epidemiology, particularly concerning the zoonotic and reverse zoonotic transfer of pathogens and antimicrobial resistance. Here, we isolated and performed whole-genome sequencing on 42
E. coli ST457 strains from Australian gulls, 2 Australian clinical strains, and 14
E. coli ST457 from chickens in Paraguay, then utilized the EnteroBase database where we recovered 80 ST457 WGS to determine their phylogenetic relationships to one another from a global context. EnteroBase currently places the oldest strain of ST457 to 2001, in an Australian wild bird. The EnteroBase ST457 strains have increased from 176 to 219 in 5 months, since we began the analysis on 30 October 2019. EnteroBase records demonstrate the global expansion of this lineage and indicate ST457 can be isolated from diverse sources, including healthy and sick humans, poultry, cattle, swine, wild animals, companion animals, water, and food. ST457 is geographically disseminated, having been isolated from countries including Australia, New Zealand, China, Japan, Vietnam, Cambodia, the United States, Argentina, Bolivia, Mexico, Ecuador, Peru, Brazil, Germany, the United Kingdom, Luxembourg, France, Italy, the Netherlands, and Spain (
http://enterobase.warwick.ac.uk/).
Through the generation of a global phylogeny, we identified five primary clades for
E. coli ST457. Three of these clades contained strains from our studies, clustering with other sequences in EnteroBase. We observed several cases of clonal spread defined (≤20 SNP differences in the 3,756 core genes), including strains of bovine origin from Luxembourg, human strains from the United States, clinical strains from China, human strains from the United Kingdom, bovine strains from the Netherlands, poultry strains from Paraguay, gull strains from Australia, and human and clinical strains from Australia. We would like to especially highlight the sharing of the same clones of ST457 among an Australian gull and human/clinical strains that was observed in three cases (Table S2, H25 South). Our data suggest that a clonal lineage of ST457 can occupy a different niche in New South Wales, Australia and implies gulls may be an environmental reservoir and transmission route for human pathogens. Whether our data supports evidence for zoonosis, zooanthroponosis, or both is uncertain. Gulls from Five Islands are known to visit municipal waste sites in the Wollongong region of New South Wales (
43). Australian silver gulls were previously identified as carriers of virulent and antibiotic-resistant human-associated
E. coli clones (
44). Interestingly, a wild bird (
Dacelo novaeguineae) ST457 strain coming from Queensland is closely related to one of our gull-sourced strains, which highlights that such strains may be further distributed in the environment across long distances via the flight behavior of the birds that carry them.
Moreover, we have detected closely related (≤80 SNPs) human strains of ST457 sourced from very distinct countries, e.g., Japan/Peru; United Kingdom/United States/Cambodia; New Zealand/the Netherlands; Germany/Japan; New Zealand/Germany; and the Netherlands/Australia (clinical). Similarly, closely related isolates were observed from diverse sources and countries as Brazil (swine)/China (human); China (poultry)/Vietnam (clinical); United States (poultry)/German (human); United States (poultry)/Australia (gull); United States (poultry)/Australia (human); United States (water)/Japan (human), or New Zealand (human)/United States (water) (Table S2, H25 North, H25 South, H6). These examples of highly related strains being sourced from such distinct environments highlight the international and cross-environmental transmission of microbes, and is also an indication of the capacity of
E. coli ST457 to inhabit multiple and diverse niches and acquire mobile genetic elements from those niches. Similarly,
E. coli strains belonging to STs which commonly cause ExPEC diseases show zoonotic potential. Some have been detected even within the most distinct environmental niches, including ST131, ST73, ST648, or ST95 in Antarctic pinnipeds (
19). Moreover, a lack of specific host-adaptation markers was shown previously for ExPEC, such as ST131 or ST648, which suggests they are able to survive and proliferate in diverse hosts and are not specifically human-adapted (
45). The phylogenetic analysis of the accessory genome of ST457 showed that accessory clades are in relatively good concordance with core genome clades and do not demonstrate niche-based adaptation (Fig. S2). This suggests at least some clades or subclades (H45, H25 North, Australian part of H25 South) are quite conserved and well-developed, which is interesting for this seemingly young lineage and in contrast to observations in ST648, for which accessory genome to core genome clade clustering was limited (
45). Nevertheless, further sampling is needed to shed light on the distribution and evolution of ST457.
The frequently observed VAGs in ST457 encoded a range of functions suspected to play roles in cell survival during extraintestinal infections, including siderophores and heme-acquisition proteins, adhesins, capsules, and fimbria. Capsule antigens play crucial roles in many host pathogen interactions, including ExPEC, as they provide protection against complement-mediated killing (
19,
46) and influence niche colonization (
47). Genes encoding the siderophore enterobactin seem to be related not only to iron uptake but they may play other physiological roles, including responses to oxidative stress (
48). Another siderophore, yersiniabactin, was previously described to contribute to the virulence of
E. coli and possibly to copper resistance (
46), and its frequent association with UPEC strains even led to attempts to use it as an antigen for vaccines against UTIs (
49). Interestingly, the yersiniabactin gene cluster was not present in the H25 North clade, which otherwise seems to be a progressive clade with a propensity to carry resistance gene cargo. The reason for lacking the yersiniabactin is unclear; pathogen evolution is complex and the acquisition of virulence attributes by commensal or otherwise less-pathogenic lineages, as suggested by the acquisition of ColV plasmids and genomic islands (
50), represents one path (
51–53). On the other hand, loss of genomic DNA represents another path to pathogen evolution. Some
E. coli may undergo a process of genome reduction to become more host-adapted and be less obvious to the host immune system (
46).
One of the main factors which determine whether a bacterium will be successful in a specific niche is reflected in its metabolism capacities (
46). ExPEC strains are capable of utilizing substrates not typical of commensal flora (
54,
55). Some studies propose that the capacity for rapid growth linked to metabolic capacity should be considered a virulence attribute (
56,
57). We detected genes in all ST457 strains that were absent in
E. coli K-12 that may to be related with utilization of three (3-phospho-
d-glycerate, cellobiose, and
N-acetyl-
d-galactosamine) out of four nutrients predicted by Monk et al. (
54) to give ExPEC strains a catabolic advantage. Further studies are needed to investigate these preliminary observations.
A large-scale study focused on identifying resistance to critically important antibiotics in silver gulls in Australia revealed that 22% of the samples were positive for
E. coli isolates resistant to extended-spectrum cephalosporins (
44). Previous research focused on phylogenetic group F in Australia identified that some STs are more likely to be fluoroquinolone resistant (ST354, ST648, and ST3711) than others (ST59 and ST62) with unclear reasons as to why (
58). Two ST457 wild bird strains described in that study were included in our collection (IA1961, referred to as B1700, and IA1962, referred to as B093) (
58). Our results showed ST457 carried quinolone-resistance genes only at low frequency, while appearing better adapted to carrying ESBL/AmpC genes. Most of the Australian ST457 strains sourced from gulls and humans in our study were not MDR, carrying only
blaCMY-2, but they were widespread, representing the most common chromosome type isolated from these gull samples. A recent study focusing on ESBL-producing
E. coli strains from a hospital in Taiwan showed that ST457 caused 4% of UTIs (3rd after ST131 and ST38; equal to ST69 and ST405), 5% of primary bacteremia (4th after ST131, ST38, and ST405), and 8% of biliary tree infections (2nd after ST131; equal to ST38, ST12, and ST10) (
38). Another study from a hospital in Nigeria with a nonbiased selection of ExPEC strains from urine, blood, and intraabdominal wounds identified ST457 as the second most prevalent type (20%; 12/60), and all of these strains carried
blaCTX-M-15 (
18). These findings reveal ST457 is one of the major ESBL-connected
E. coli pathogens in Taiwan and in Nigeria, two countries with no known ST457 representatives in EnteroBase. These data suggest that ST457 is an emerging
E. coli ST that is able to cause ExPEC disease and occupy diverse niches in humans, food animals, wild birds, and water. Moreover, ST457 seems to benefit from carriage of ESBL/AmpC genes. Whether this advantage lies specifically in the carriage of the resistance genes or from carriage of plasmids with genetic cargo that confers resistance to these antibiotics and other unknown traits requires further investigation. As these ESBL/AmpC genes are often carried on plasmids, it is conceivable that the capacity to acquire plasmids carrying these resistance genes plays a role in the success of ST457 and other pathogenic lineages.
Based on general findings and our observations, we focused on the two most common plasmid families, I1 and F plasmids, which are known to disseminate ESBL/AmpC genes (
59–61). Successful conjugation experiments were performed with I1/ST23 and F18:A-:B1 plasmids from the gull collection in this study and previously with I1/ST113 plasmids from the Paraguayan collection (
31). While we could not identify a single plasmid lineage which would be related to ST457 in general, we observed several country or country/source specific connections: (i) I1/ST23, I1/ST314, and I1/ST315 disseminating
blaCMY-2 in Australia; (ii) I1/ST113 carrying
blaCTX-M-8 and
mcr-5 in Paraguayan poultry; and (iii) F2:A-:B1 plasmids of Dutch origin appearing across multiple ST457 clades.
We investigated the spread of I1/ST23 plasmids carrying
blaCMY-2 among Australian strains, including gull, human, and clinical isolates. Comparisons of the sequences showed that identical plasmids are shared by isolates from gulls and humans, directly linking the microbiomes in Australia. Interestingly, these plasmids of identical structure were found predominantly in gull isolates from Sydney, with some also present in gulls from Five Islands but none from Montague Island, the most distant from large human populations of the three sites (Fig. S3,
Fig. 2). This proximity of gulls to humans in urban areas is the likely contributing factor here, indicating that the gull gut microbiome can acquire β-lactam-resistant
E. coli from environments where extended-spectrum β-lactams are used. These observations concur with those from a previous study in Alaska that showed that, in the case of antibiotic resistance, the proximity of the gull population to urban areas is a relevant factor influencing the level of resistance found in the gull microbial population (
62).
Moreover, we identified a
Salmonella strain SJTUF10584 from GenBank carrying a highly similar plasmid (100% query and 99.9% identity compared to the CE1628 I1/ST23 reference), yet with a different plasmid sequence type 265 from Shanghai. A comparison of plasmid sequences from previous studies which described a
blaCMY-2 region with 100% identity with our reference I1/ST23 plasmid suggests the region may have been transferred to I1 plasmids from a C plasmid (formerly A/C) (
41,
42). Moreover, this complex resistance structure was present in two novel I1 plasmid sequence types described in this study, I1/ST314 and I1/ST315. I1/ST23 plasmids are known to disseminate
blaCMY-2 in American crows (
36). Moreover, I1/ST23-
blaCMY-2 was detected in six strains from a children’s hospital in Seattle, with one of the strains typed as
E. coli ST457 (
63). These observations lend weight to the hypothesis that our phylogenetic inferences of
E. coli ST457 show the impact of this lineage reaching beyond Australia, despite its apparent prevalence there.
We observed four Japanese human and environmental strains in the H6 clade, which was otherwise related to isolates from poultry of North and South American origin. A previous study from Norizuki et al. (
32) suggested that the importation of South American chickens may have enhanced the spread of
blaCTX-M-8 on I1/ST113 or closely derived plasmids in Japan, with the detection of closely related plasmids from the imported chickens and the Japanese population. We observed a high prevalence of I1/ST113 plasmids carrying
blaCTX-M-8 and
mcr-5 in our chicken samples from Paraguay. Further investigation that examines phylogenetic relations between those I1/ST113 plasmids is needed to interrogate this hypothesis.
Plasmids found in
E. coli with similar backbone sequences have a remarkable capacity to acquire drug resistance genes that reflect the antimicrobial selection pressures to which their host
E. coli are exposed (
52,
64,
65). Plasmids are also able to move about (
66) within an animal species, between humans and animals (
52), and between
E. coli and
Salmonella enterica (
64). From the avian collection here, we detected some clearly successful plasmid lineages. This includes I1/ST12, which is known for the global dissemination of
blaCMY-2 in various environments (
36,
60,
67). Most of the detected F replicons were previously reported from diverse sources, in diverse
E. coli plasmid STs, e.g., F24:A-:B1 or F18:A-:B1 (
19,
61). However, F18:A-:B1 plasmids of approximately 226 kbp carrying eight resistance genes encoding resistance to six antibiotic classes, including
mcr-1 and
blaCTX-M-55, were previously described in ST457 (
16). Plasmids with this replicon were present in one gull strain and several human strains in our study.
Notably, we detected a
blaOXA-23 carbapenemase-encoding gene on the chromosome of an ST457 isolated from a silver gull. To the best of our knowledge, this is the first report of this carbapenemase gene in
E. coli from a wild bird, as it is typically present in
Acinetobacter spp. and generally uncommon in
E. coli (
39,
68). The strain showed low carbapenemase activity and its MIC values for carbapenems were much lower than in the previously described plasmid-encoded
blaOXA-23 region detected in a clinical
E. coli isolate (
39). As the plasmid region of 2,577 bp described in GenBank accession
KJ716226 (
39) was identical to ours (
Fig. 3), we may assume that broader gene context may influence gene expression.
In summary, this study provides insights into E. coli ST457, a lineage of E. coli that displays a propensity to acquire and retain plasmid cargo carrying resistance to clinically important antibiotics and cause extraintestinal pathogenic E. coli (ExPEC) disease. ST457 was the most frequent ESBL-carrying E. coli ST recovered from Australian silver gulls sampled from multiple islands off the coast of New South Wales in 2012. A comparison of all ST457 WGS available in public repositories with those generated in this study enabled us to readily identify shared clonal lineages in gulls and humans in Australia and worldwide, indicating that in some cases gulls may be an important vector for the transmission of extended-spectrum β-lactam resistant ST457 that is capable of causing ExPEC disease. Our analyses show that: (i) plasmids carried by E. coli ST457 play an important role in the carriage and dissemination of resistance to clinically important antibiotics, particularly ESBL/AmpC genes, and possibly mcr genes, and a conjugative I1/ST23 plasmid was completely sequenced here; (ii) ST457 may be emerging as a pathogen in humans given its recent description in several hospitals linked with ExPEC disease in countries in both the northern and southern hemispheres; (iii) ST457 has been isolated from diverse mammalian and avian hosts and from the environment, underscoring its metabolic capacity; and (iv) plasmid acquisition has played an important role in the evolution of E. coli ST457.