INTRODUCTION
Rising antimicrobial resistance (AMR) poses a major threat to public health globally (
1,
2). Antibiotic resistance impairs our ability to adequately treat infections and is associated with increased morbidity and mortality (
1). Although broad-spectrum antibiotic use could improve the reliability of coverage, it also promotes the selection of antibiotic resistance to these agents, which can lead to further spread in antibiotic-resistant organisms (AROs) (
3). Rapid diagnostic approaches offer the promise of both improving the reliability of antibiotic coverage while also potentially narrowing the spectrum of initial antibiotic use. Despite this clear benefit, the advancement and clinical integration of rapid diagnostic methods for use in clinical care has been slow (
4).
Our current methods of diagnosing infecting pathogens are still largely culture-based, which are both time- and resource-intensive (
5). Healthcare providers must rely on empiric antibiotic therapy for the initial management of many infectious syndromes that carry the risk of either being unnecessarily broad spectrum or conversely inadequate for the underlying pathogen(s). Urinary tract infections (UTIs) have been significantly impacted by AMR, with the likelihood of activity for many first-line therapies declining over the decades due to rising AMR (
6). This can have significant downstream effects on human health globally, given it is one of the most common community-acquired bacterial infections and can also manifest as severe disease including septic shock (
7,
8). Urine specimens take approximately 36 h for the microbiology laboratory to complete pathogen identification and phenotypic antibiotic susceptibility testing, often following 12–24 h or more of transiting the specimen to a laboratory (
9). From the time of collection, it can take up to 4 days for a patient with a UTI to receive their result (
10). Delays in therapy, particularly in critically ill patients, can lead to increased morbidity and mortality, as well as patients receiving overly broad or ineffective antibiotics (
11). Developing methods, which can give providers more actionable information sooner, could help improve their empiric therapies by identifying highly resistant organisms (in particular those resistant to first-line agents), as well as permitting the reconsideration of second-line antibiotics that had previously been abandoned due to low likelihood of activity (
12).
Recently, much effort has been placed on sequencing-based methods for accurately assessing the microbial cause of illness and antibiotic susceptibility phenotypes with the aim of developing culture-free diagnostics (
4,
13). However, these methods often rely upon lengthy- and resource-intense gene-based alignments to reference databases to identify potential resistance genes and profiles (
4,
14,
15). Moreover, the presence or absence of a given resistance gene is insufficient to fully assess the resulting AMR phenotype, possibly through interactions with other resistance determinants, inducible resistance, or previously unrecognized mechanisms of resistance (
16,
17). More recently, developments in
k-mer-based strategies have shown that the analysis times required to match sequenced samples against curated databases may be greatly reduced (
18). This method matches smaller portions of genetic material against
k-mer databases made from the assemblies of known isolates with their associated antibiotic susceptibility profiles, meaning it can be made to run much faster than traditional gene-matching methods (
18). The RASE method relies on neighbor typing algorithms, which predict both the (
1) phylogroup (or sequence type) and (
2) susceptibility patterns for unknown samples by utilizing the link between phylogeny and phenotype. Prior work has retrospectively evaluated the potential applications of RASE for
Streptococcus pneumoniae and
Neisseria gonorrhoeae (
19,
20) but has not been prospectively evaluated for the most common Gram-negative pathogens
Escherichia coli and
Klebsiella spp.
Pairing sequencing of primary urine specimens with fast neighbor typing algorithms could be used to rapidly generate information about a pathogen’s likely antibiotic susceptibility phenotype to guide empiric treatment until the availability of gold-standard culture-based results. Urine samples are particularly well suited for this approach, given the often high bacterial loads present and the potential for impacting current care practices.
In this multicenter prospective study, we demonstrate the use of ONT MinION direct metagenomic sequencing, paired with k-mer-based neighbor typing to rapidly predict the multilocus sequence type (MLST) and susceptibility profile of typical uropathogen(s) identified in suspected urinary tract infection.
DISCUSSION
In this study, we evaluate a rapid diagnostic approach of paired metagenomic sequencing with neighbor typing to predict MLST and antibiotic susceptibility phenotype across two important Gram-negative pathogens (
19) in urine samples. Using this approach, we were able to improve the confidence of empiric antibiotic susceptibility predictions over the local antibiogram. In particular, we demonstrate that this method has the potential to support the reconsideration of clinically relevant but less commonly used antibiotics for use prior to definitive phenotype identification by moving their likelihood of activity above treatment thresholds and identifying infections with a high likelihood of non-susceptibility that would require broader spectrum antimicrobials.
Prior studies using metagenomic approaches for rapid antibiotic resistance prediction have shown that susceptibility and non-susceptibility can be inferred within 10 min of sequencing for both
Streptococcus pneumoniae and
Neisseria gonorrhoeae using the RASE algorithm (
18). Other work linking metagenomic approaches to antibiotic resistance prediction has included mapping reads to specific genomes or collections of antibiotic resistance genes and showed some success (
26–28). However, these methods often require matching to specific genes or rely on unsupervised approaches, and linking genotype to phenotype can be challenging (
29). Methods using machine learning and artificial intelligence are also on the rise, which also works to link genotypic information with phenotypic data, and these methods have shown some success in predicting AMR phenotypes (
30–33). With this study, we show neighbor typing combined with metagenomic sequencing has the potential to provide rapid and informative predictions of antibiotic resistance in the two most common Gram-negative pathogens,
E. coli and
Klebsiella.
The approach used in this paper is rapid with necessary sequencing times of typically less than 30 min. We can determine the expected total workflow duration based on these sequencing times as well as the typical sample processing and library preparation times. Using the methods described herein, it typically takes 90 min to pre-process and deplete eukaryotic DNA, 60 min to extract prokaryotic DNA, and 3 h for library preparation. Taken together, this means results can be available within 6 h of sample collection (excluding transit times). Of this total time period, the first 5.5 h require some degree of hands-on time, although this could be reduced (
34–36). One present limitation lies in the specialized training and familiarity with bioinformatics workflows needed for completing and interpreting analyses. However, these processes can be automated and simplified for ease of use. As well, the requirement of batching samples to achieve economy of scale is a limitation to the rapidity of analyses. However, this could be overcome through the use of the Flongle system (ONT), which can be used as a single sample disposable platform at a similar cost per run to minION flow cells (a single Flongle flow cell currently costs approximately one–tenth of the price of a single standard flow cell).
Information that can significantly improve empiric antibiotic selection (currently based on local antibiograms) can be imperfect yet still benefit decision-making (
22). Although microbiologic diagnostics have historically produced dichotomized results (e.g., sensitive vs. resistant), recognizing that even these have their own critical limitations, new technological advances support the use of alternative data streams that can provide probabilistic predictions at faster timelines (
14,
37). Further work is needed to find how this information can be best incorporated into clinical decision-making.
We evaluated how stratification by two RASE-produced metrics might improve the confidence in calls made by RASE. Although not statistically significant, stratification by SS, LS, or LS + SS may provide some improvement to the predictions for both
E. coli and
Klebsiella spp., with some benefit to likelihood ratios, sensitivities, and specificities, but at a cost of significant loss of the proportion of specimens with predictions. In keeping with this, for
E. coli, concordance improved as LS improved, which fits with phylogroup-based prediction and supports the notion that improving the ability to identify a related strain within the reference database (in this case of the same MLST) improves the ability to predict antibiotic susceptibility. Although a perfect relationship between MLST and relatedness is not expected, MLST (using the traditional seven loci) has reliably been used for the identification of genus, species, and lineage/clonal complexes (
38). However, for the distinction between highly related clones or strains, a higher resolution is necessary to distinguish relatedness and may require other classification methods (
38,
39). Interestingly, for
Klebsiella spp. this did not clearly hold. Although neighbor typing for
Klebsiella spp. improved calls for susceptibility/non-susceptibility, it may be independent of MLST, though it is hard to definitively conclude this with the existing sample size for this genus.
This method of neighbor typing and antibiotic susceptibility prediction fundamentally relies on the databases that are used for making these predictions. Our results using a European (EuSCAPE) database showed that performance was similar to that of the regional database, whereas local databases seemed to perform no better, meaning that in this case, databases from other continents could be used to make predictions. This shows that the physical proximity of database samples may matter less than having a diverse reference database and having samples that are genetically proximate. Combined with the results from the EuSCAPE database, this highlights the need for large, well-structured reference databases with a diversity of isolates to make better concordant matches.
There are several strengths to the approach outlined in this paper. The proposed method can be fast, with the potential for informative results to be generated in as little as 6 h. Rapid turnaround of samples shows this approach could meaningfully impact on empiric management of infections, with the potential to improve patient outcomes and reduce the reliance on broader-spectrum antibiotics. Moreover, this method could be adaptable for other approaches beyond susceptibility prediction, including rapidly ruling out potential transmission. Finally, based on our evaluation using an international database, we show that this method may also be implementable globally with large databases.
There are limitations to the work we present here. First, most of our results are based on a single geographic region (Ontario, Canada), and further study will be required to determine utility elsewhere. We also have a limited number of samples presented here, and only from two genera of
Enterobacterales; hence, our results are underpowered for detecting statistically significant (or insignificant) differences in the test characteristics following stratification. Also, this method currently cannot quantify the pathogen for significant growth, as we are unable to relate sequencing results to the quantitation of colony-forming units (CFU). Moreover, 9.1% of our remnant urine samples were excluded from the analysis because the expected uropathogen was not present. This could produce inconsistencies between what was reported in the culture report and the true sample and could fail to detect a pathogen that was present in the sample at low titers. However, these limitations reflect ongoing challenges with the diagnosis of UTIs using culture-based techniques (
9). Finally, one concern of the RASE method is how it accounts for mobile genetic elements, such as plasmids. AMR genes are commonly found on plasmids in
E. coli and
Klebsiella spp. (
40). Presently, RASE is unable to distinguish if there is a presence or absence of plasmids in a genome, as well as if there has been any plasmid recombination or partial integration with the chromosome. However, whether plasmids are present is still relevant to the RASE method. That is, some lineages of bacteria are better at accepting and retaining plasmids due to the impact that plasmid acquisition can have on the fitness of the host (
41). If lineages have been well-sampled, there should be lineages captured with and without AMR plasmids with resistance phenotypes, and this risk of resistance should be reported by the susceptibility score produced by RASE (
18). Although the presence or absence of AMR plasmids is not explicitly accounted for in this work, sampling of various lineages, including several isolates from the same lineage, should capture a broad view of samples with and without plasmids and should assess some level of the risk for resistant phenotypes from this sampling.
In conclusion, we show that combining metagenomic sequencing with neighbor typing algorithms has the potential to support rapid and informative predictions of the susceptibility of dominant pathogens present in urine samples. Current prediction performance would benefit from future optimization to further improve potential clinical impacts. Additional work is needed to evaluate approaches to accurately quantify absolute bacterial load in metagenomic specimens and prospectively evaluate impacts on UTI treatment.