INTRODUCTION
Newborns admitted to neonatal intensive care units (NICUs) are at high risk of contracting health care-associated infections (HAIs) due to the immaturity of their immune systems and the intensity of medical interventions required.
Serratia marcescens is a Gram-negative bacterium that is commonly associated with HAI outbreaks in NICUs. In a recent review of the literature on NICU outbreaks,
S. marcescens was associated with 5 of 39 outbreaks (
1), with a broad clinical spectrum, including septicemia, conjunctivitis, pneumonia, urinary tract infections, and meningitis.
S. marcescens is ubiquitous in the environment, and previously identified sources of contamination included the hands of health care workers, medical devices, soap, sinks, and milk (
2–4). Although environmental sampling occasionally led to the isolation of
S. marcescens, the source of contamination remained unidentified in most outbreaks (
5,
6). In NICUs,
S. marcescens gastrointestinal carriage was identified as a potential reservoir (
5,
7) and was associated with greater prematurity, use of antibiotics, and mechanical ventilation (
7).
Previous investigations showed that generally more than one clone can be identified during
S. marcescens outbreak investigations (
2,
5,
8,
9). Typing of bacterial isolates can be particularly challenging and relies on the use of molecular biology tools such as pulsed-field gel electrophoresis (PFGE), multilocus sequence typing (MLST), and ribotyping. PFGE is the more commonly used typing technique and was applied previously to the investigations of
S. marcescens outbreaks in NICUs (
2,
10,
11). Although PFGE is still the gold standard for most bacterial species, whole-genome sequencing (WGS) is a promising and increasingly used tool for strain typing, often providing higher resolution than other tools (
12–14).
Other recently developed next-generation sequencing (NGS) tools, such as targeted amplicon sequencing of bacterial phylogenetic marker genes, could improve outbreak investigations, allowing investigators to profile the microbial community composition in environmental samples and improving our understanding of microbial spread and colonization of the hospital environment (
15–17). In this study, we describe how the use of advanced molecular biology tools, such as bacterial WGS and sequencing of amplicons from environmental DNA, is challenging but can provide crucial information in the context of a NICU
S. marcescens outbreak.
DISCUSSION
NGS tools have great potential for use in clinical microbiology and outbreak investigations (
40,
41). Here, WGS of bacteria and amplicon sequencing of environmental samples were applied to the characterization of a NICU
S. marcescens outbreak. WGS was mainly used for strain typing, using a core genome hqSNV bioinformatics approach (
19). hqSNV was used previously for typing of other members of the
Enterobacteriaceae family (
12,
42,
43), as well as other groups of bacteria (
13,
14,
44), but this is the first report, to our knowledge, of its use for the investigation of a
S. marcescens outbreak. Good correlation was found between PFGE and hqSNV results, with between 0 and 5 SNV positions being identified for isolates sharing the same PFGE profile; higher numbers of SNVs were identified for isolates with distinct PFGE patterns. Previous studies found that the numbers of SNVs among isolates from the same outbreak can vary depending of the bacterial taxon analyzed. Using the same bioinformatics approach, the numbers of SNVs identified among isolates from the same outbreak varied from 0 to 4 for
Salmonella enterica serovar Heidelberg (
12), from 0 to 3 for
Salmonella enterica serovar Enteritidis (
45), from 0 to 5 for
Escherichia coli O157 (
46), and from 0 to 2 for
Vibrio cholerae (
47). As reported here, the use of a SNV density filtering step at different cutoff values has a strong effect on the number of hqSNVs identified among clusters (
19). Although the supporting epidemiological information and the number of isolates tested were not sufficient to define the SNV density and the numbers of core hqSNVs that should be used as cutoff values to identify isolates belonging to an outbreak, the results showed the clear potential for using WGS for
S. marcescens typing.
Both PFGE and core hqSNV identified multiple clones during this NICU
S. marcescens outbreak investigation, a phenomenon reported previously (
2,
5,
8). Strains isolated from patients belonged to 5 different PFGE banding patterns and hqSNV clusters, but only 1 of those was identified in more than 1 patient (PFGE pattern A). Similarly, implementation of
S. marcescens screening procedures in the NICU of a German hospital led to the identification of isolates corresponding to 8 different PFGE patterns over a 3-month period, with 6 of the patterns occurring in the same unit and 1 dominant strain leading to multiple cases (
8). These results indicate that multiple sources could be involved in
S. marcescens contamination of hospitalized neonates and that screening procedures implemented during outbreak investigations might have detected
S. marcescens strains that would have gone undetected under normal conditions.
Environmental sampling led to the isolation of 2 strains from drains and 1 strain from around a bed. Previous studies identified
S. marcescens in drains (
48–51), although in some cases there was no link between the isolates and the outbreak under investigation (
48,
49). Drains might have been one of the contamination sources here; 1 drain isolate had a matching PFGE banding pattern and a distance of 0 to 5 hqSNVs in comparison with 2 isolates from the same patient. The other drain isolate was collected from the room of a patient infected with the dominant clone of this outbreak, but unfortunately the isolate was lost prior to typing. It is impossible to know which event came first, i.e., the drain microbiota colonizing the patient or the patient microbiota colonizing the environment.
While allowing for strain typing, the use of WGS in outbreak investigations can provide extended information about outbreak isolates, such as the presence of antibiotic resistance genes and plasmids. In this case, no plasmid was detected but 4 chromosomal genes associated with antibiotic resistance were identified in all strains, 2 of which are known to be linked to aminoglycoside and β-lactam resistance in
S. marcescens (
52,
53). Although determining the presence of antibiotic resistance genes was not clinically relevant and could not be related to phenotypes, due to the absence of antibiotic susceptibility testing data, these results showed the potential of WGS for the investigation of resistant strains of
S. marcescens, as reported for other bacteria (reviewed in reference
54).
Metagenomic sequencing will become essential for outbreak investigations as culture-based methods are increasingly replaced by molecular assays for diagnostic purposes in clinical microbiology laboratories (
55). Here, we used a high-throughput targeted amplicon sequencing approach in order to characterize the bacterial communities and to evaluate the relative abundance of
Serratia in swabs collected from different sampling sites in positive and negative rooms during a NICU
S. marcescens outbreak. In contrast to shotgun metagenomics, in which the total DNA contained in a sample is sequenced, the amplicon metagenomic approach includes a step involving PCR amplification of a phylogenetic marker gene, usually the 16S rRNA gene for bacteria. Although it is less informative than shotgun metagenomics and provides taxonomic information only, marker gene amplicon sequencing might be better suited as a diagnostic tool because it is faster, cost-effective, and applicable to low-biomass samples such as the environmental swabs collected in this study. However, results have to be interpreted with caution since, as reported previously and exemplified here, low-biomass samples are prone to contamination with exogenous DNA from various sources, including reagents and human skin (
56). Although strategies to filter contaminants have been proposed (
57), parsing sequences of contaminants from sequences of microbes truly belonging to the sample remains a challenge that needs to be addressed in order to facilitate the use of amplicon sequencing in clinical settings. Selection of reagents with less contaminating DNA could help to circumvent the problem. Although the
gyrB sequencing protocol allowed the identification of
Serratia marcescens at the species level in environmental samples, further optimization will be required for accurate bacterial community profiling, since the method failed to detect several bacterial genera in a mock community. This finding might be linked to the primers that were used, which were highly degenerate and could fail to amplify certain groups of bacteria.
Whereas culture only allowed the detection of
S. marcescens in 3 environmental samples, metagenomic sequencing of the 16S and
gyrB genes identified sequences related to
Serratia in most of the samples tested, with greater proportions being observed around beds and in ventilator/infusion pumps, indicating that these sites are potential reservoirs for this bacterium. These sampling sites were directly in contact with babies infected or colonized by
S. marcescens, in contrast to the other surfaces tested. A previous NICU study based on 16S rRNA gene amplicon sequencing also reported greater relative abundances of outbreak-related bacteria in samples collected from neonate-associated surfaces, compared with other environmental samples (
17). In this outbreak, 1
S. marcescens isolate was recovered from around 1 bed, and the particular isolate was of the same genotype as 3 isolates from newborns, supporting the metagenomic findings. The presence of
S. marcescens around beds might be linked to the presence of this bacterium in the gastrointestinal tract of newborns, which represents an important reservoir for cross-contamination (
5–7).
The relative abundances of
Serratia were low in all samples collected in the control room, indicating that the disinfection protocol used to eliminate potential sources of contamination following discharge of a case was efficient. Interestingly, the community composition and structure at several sampling sites in the control room were also clearly distinct from those of the positive rooms, illustrating the strong effects of disinfection on bacterial communities, as described previously (
17). Further testing would be required to confirm these observations, since a single disinfected control room was available for this study.
Low relative abundances of
Serratia sequences were detected in the drains of positive rooms, while 2 of 3 environmental isolates were recovered from those sampling sites. This phenomenon illustrates a potential limitation of the use of relative abundances in gene-targeted metagenomic studies; although the relative abundance of
Serratia in drains was low, the total numbers of
Serratia cells might still be higher than on dry surfaces, due to the massive amounts of biomass colonizing drains, which could have facilitated the recovery of
S. marcescens isolates from those sites. This issue could be circumvented in further studies by using quantitative PCR to estimate the bacterial load in each sample (
58) or by spiking exogenous bacteria prior to sample processing (
59).
While providing sensitive and informative data regarding the environmental spread of a causative agent during an outbreak, as exemplified here in the context of a
S. marcescens outbreak, sequencing of bacterial taxonomic marker genes facilitates better overall understanding of bacterial communities in hospitals. Projects such as the Hospital Microbiome Project specifically aim to identify factors that modulate microbial population development in health care environments through extended surface, air, staff, and patient sampling (
15). One trend observed in a recent publication from that project was that surfaces in patients' rooms, especially bedrails, harbored microbial communities resembling those of the skin of the current patient (
60). Although we could not confirm such a trend, as no such samples were collected from patients, bacterial genera previously identified as dominant in the skin microbiome of newborns (
61), such as
Streptococcus and
Staphylococcus, were found at high relative abundances in NICU surface samples. However, these findings have to be interpreted with caution, since DNA from skin bacteria tend to be found everywhere and are often detected as contaminants, as discussed above. Interestingly,
Serratia was detected at low relative abundances in the negative controls and was not identified as a dominant member of bacterial communities on surfaces in previous NICU metagenomic studies (
16,
17), indicating that the high relative abundances observed here at newborn-affected sites might be directly linked to the
S. marcescens outbreak. However, those studies reported the presence of several bacterial genera commonly linked to NICU outbreaks, as was the case in our study.
In conclusion, outbreak investigations of hospital-acquired infections are challenging but necessary in order to control and to limit the burden of this increasing threat. In this study, we described how the use of high-throughput sequencing applications, such as bacterial WGS and targeted amplicon sequencing, can provide new tools that have the potential to contribute to outbreak investigations. Links between S. marcescens isolates and potential antibiotic resistance genes and plasmids were assessed by WGS, while sequencing of bacterial phylogenetic marker genes (16S rRNA and gyrB genes) was used to map the distribution of Serratia strains and to describe bacterial communities in hospital rooms. Although we applied these tools to a specific NICU outbreak, they could easily be used for investigations of outbreaks caused by other bacteria, with the potential to improve our knowledge of bacterial ecology and transmission pathways in the hospital environment, which in turn could support and provide evidence for the development and evaluation of strategies for infection prevention and control.