INTRODUCTION
Enterococcus faecalis is a Gram-positive commensal bacterium of the mammalian gastrointestinal (GI) tract (
1) and a leading cause of nosocomial infections worldwide (
2). In humans, the normal abundance of
E. faecalis in the intestinal lumen ranges between 10
5 and 10
8 CFU/g of feces without causing any obvious deleterious effects on the host (
3,
4). However, if either perturbations of the host/commensal balance that weaken the host immune system or environmental factors such as use of antibiotics that inadvertently facilitate outgrowth of resistant
E. faecalis occur, life-threatening infections might arise. Moreover, the intrinsic robustness enables
E. faecalis to withstand multiple stresses (
5,
6) and provides a fitness advantage with respect to host adaptation and colonization in environments such as the hospital setting (
7–9). The aptitude of this bacterium for acquisition and transfer of mobile genetic elements (plasmids, transposons, and prophages) has facilitated the spread of virulence traits and antibiotic resistance genes among isolates (
10–14).
Notwithstanding several studies undertaken in the last few decades having identified putative virulence determinants that may augment the ability of
E. faecalis to cause disease (reviewed in reference
15), the incidence of many pathogenicity factors has been reported to be independent from the
E. faecalis isolation source. The availability of 5 complete and more than 300 draft
E. faecalis genome sequences (
http://www.ncbi.nlm.nih.gov/genome/808) has provided the opportunity to examine the genomic diversity among strains and identify specific traits that may contribute to virulence (
16–18). However, none of these experimental investigations have performed comparative estimations of the abilities of different
E. faecalis isolates to cause infection employing a live model system with the purpose of correlating
E. faecalis pathogenicity to the whole-genome content.
Several studies have established the soil nematode
Caenorhabditis elegans as a suitable surrogate animal model for
in vivo study of host-microbe interactions with
E. faecalis. Through comparison of the levels of longevity of nematodes infected with isogenic deletion mutants versus parental strains and screening of transposon insertion mutant libraries, several gene categories required for enterococcal pathogenicity in
C. elegans have been identified, including virulence factors such as cytolysin (Cyl) (
19) and gelatinase (Gel) and serine protease (
20) and factors playing a role in cell metabolism and physiology (
21,
22). Notably, these reports have demonstrated that several virulence factors required for infections of
C. elegans and mammals coincide (
21,
23).
In the present study, we investigated the pathogenicity potential of a collection of 28 E. faecalis strains in the C. elegans infection model and the correlation between the virulence phenotype and their whole-genome content. Consistent with previous reports, we found that cytolysin (Cyl) and gelatinase (Gel) are the major factors involved in C. elegans killing by E. faecalis. Further, using a comparative pangenomic analysis combined with statistical regression model testing, a virulence reference map that extends beyond the production of cytolysin and gelatinase was established. This allowed us to identify an important role of a phage03-like element in pathogenicity among nosocomial isolates.
DISCUSSION
The emergence of
E. faecalis strains as prominent nosocomial pathogens and the occurrence of enterococcal infections not treatable with most available antibiotics emphasize the need for understanding the complex mechanism by which this organism causes life-threatening diseases (
49). The availability of large amounts of whole-genome sequencing data has facilitated
in silico comparative analyses of the gene content of enterococcal strains representing a wide spectrum of biotypes, clonal lineages, and isolation sources. As result, these investigations have contributed to the determination of the size and composition of the
E. faecalis core genome (
40,
50), the genetic background for antibiotic resistance (
51), and the identification of potential virulence factors that can influence the bacterial lifestyle (
16). However, there is still a lack of understanding of how the genetic variation between commensal and nosocomial strains could explain the alleged behavior associated with these.
In the present study, we combined statistical and comparative genomic approaches, followed by a functional genomic study, with a model host infection system to examine the correlation between
E. faecalis whole-genome content and its impact in the host-microbe interaction. We showed that infection of
C. elegans by 28
E. faecalis isolates resulted in a wide range of virulence phenotypes, with six strains behaving in a commensal manner, with no nematocidal effect, and 22 strains displaying a virulence phenotype of from high to low levels of pathogenicity. Principal component analysis showed that the first and second principal components account for 98% of the variance in the virulence data, thus indicating a significant relationship between
E. faecalis pathogenicity toward
C. elegans and gene content. We therefore employed a comparative approach to analyze the genomes of the 28
E. faecalis strains to identify variations in the genetic makeup that could explain commensalism or opportunistic lifestyle in the nematode. A tree based on Present/Absent gene patterns identified by the comparative genomic analysis resolved the 28 strains into three lineages and showed a significant correlation between Present/Absent profiles and the phenotype of
E. faecalis virulence toward
C. elegans (
Fig. 4). Moreover, the tree highlighted a deep division between isolates belonging to CC2 and the other strains employed in the study.
Statistical ANOVA established a strong contribution of cytolysin and gelatinase to nematode killing which accounted for 40.8% and 36.5%, respectively, of the observed variations in survival of the nematodes. However, the presence of the two factors explained only 50.6% of the variation. Speculatively, it is conceivable that the lack of synergism between gelatinase and cytolysin is caused by saturation or antagonistic interactions between the two factors when coproduced by a strain.
In agreement with previous studies, evaluation by ANOVA of the
E. faecalis isolation source for the association with increased virulence toward
C. elegans indicated that the isolate origin had no impact on virulence in the nematodes (
52). The fact that non-clinically isolated strains also had the potential to be infectious (
Fig. 3) is consistent with the ambivalent nature of
E. faecalis in relation to its host, with the ability of apparently harmless strains to disrupt the commensal relationship with the host and cause severe diseases.
In addition, in line with a previous investigation that reported a low correlation between the presence of capsule among
E. faecalis clinical isolates and the ability to establish infection (
53), our data showed that the distributions of cell wall polysaccharides are equal at all levels of
E. faecalis virulence, and no significant difference in
C. elegans pathogenicity was observed.
Beside other reports that have clearly demonstrated that gelatinase and cytolysin are two prominent
E. faecalis virulence factors exerting a crucial role in nematode death (
19,
20), our study also identified a strong positive impact of these two factors on nematode killing. The mathematical model, computed to include the
a priori knowledge of the cytolysin and gelatinase genotype and to fit to the virulence at 72 h, established a virulence reference map of statistically significant traits that can be predictive of
E. faecalis virulence (see Table S5 in the supplemental material). Among the resulting patterns, a predominant (41%) fraction is comprised of MGEs. The presence of a multiplicity of mobile elements has been previously described for
E. faecalis V583 and is considered a major driving force of genome evolution (
42,
44,
50). In addition, a study from our group reported the enrichment of structures of MGEs among strains belonging to the high-risk nosocomial clonal complex 2 (CC2) group (
17). The prevalence of MGEs among factors contributing to
E. faecalis pathogenicity supports the notion that highly virulent isolates benefit from the introduction of fitness and virulence factors that confer increased adaption under conditions of stress and infection.
Patterns 1408 and 1523, including CDS EF1827 to EF1830 and EF1833, emerged in our work as contributing to virulence in nematodes. The associated genes reside in a 23.9-kb region corresponding to the 5′-end fragment of
fsrC (
ef1820) and the 3′-end segment of
ef1841 of
E. faecalis V583 whose deletion has been reported to cause the lack of a positive gelatinase phenotype among strains carrying the
gelE gene (
54). Interestingly, the genes encode sugar transport system components related to the
glpK pathway and are involved in glycerol catabolism in
E. faecalis (
55). As described in other bacterial species, growth in the presence of glycerol as the carbon source leads to the activation of a transcription activator also modulating the expression of virulence genes (
56).
The list of candidate genes that might contribute to enterococcal pathogenicity includes variable cell surface structure components, such as EF2164 (pattern 666), EF0967 (pattern 609), and EF3250 and EF3251 (pattern 1577), previously identified as enriched among isolates of the high-risk clonal complex 2 (
17). Surface-exposed proteins play an important role in pathogenicity, as they are often involved in colonization of host tissues and evasion of host defenses (
57,
58). Studies in
C. elegans have ruled out the role of a number of surface structures, such as the AS aggregation substance, the Esp enterococcal surface protein, the Ace adhesin to collagen, and biofilm-associated pilus protein A, in mediating nematode killing (
21,
59), suggesting that their host target may be absent. However, Creti and colleagues demonstrated that deletion of the
ef3314 gene, encoding a surface protein structurally similar to Esp, affects the pathogenicity potential of the resulting
E. faecalis mutant in
C. elegans, thus demonstrating that some surface factors contribute to pathogenesis in the worm (
22). Further functional studies are therefore required to elucidate the significance of the identified traits in enterococcal virulence.
Among the non-V583 annotated loci, pattern 1275 included two proteins holding a CaaX amino terminus and a DJ-1/PfpI domain, respectively. These domains are common to proteases; therefore, a potential role of these genes in virulence is plausible (
60).
Polylysogeny is a common feature in the
E. faecalis genome and is considered to influence the relationship of
E. faecalis with its host, as these elements may confer properties advantageous for the dissemination of the bacterial strain (
17,
41). Duerkop and colleagues demonstrated that temperate phages of
E. faecalis ST6 can modulate the assemblies of bacterial communities in the mammalian intestinal tract (
61). A more recent study showed that five (phage01, phage03, phage04, phage05, and phage07) of the seven V583 prophages are capable of excision from the bacterial chromosome and that four of them produced infective virions and are therefore possible mediators of horizontal gene transfer (
44). Bacteriophage-encoded proteins such as platelet binding proteins were established as factors that provide mechanisms to invade host tissues and damage host cells, enriching the plethora of
E. faecalis virulence traits (
43,
44).
Our comparative genomic approach pointed toward a significant enrichment of a phage03-like element in nosocomial and clinical isolates. We demonstrated that phage03 deletion resulted in a decline of
E. faecalis JH2-2 infectivity both in
C. elegans and in
G. mellonella larvae. In agreement with previous reports that have described the absence of phage03 in the genome of
E. faecalis food isolates (
45) and its enrichment among clonal lineages associated with nosocomial diseases (
17,
40), we detected the presence of phage03 in the genome of CC2 strains
E. faecalis V583, MMH594, and HH22. BLAST searches against the available
E. faecalis genomes showed that this phage03-like element was also found with a higher incidence in nosocomial isolates whereas it is rarely found in other strains of different origins (see Table S8 in the supplemental material). We therefore examined the role of the phage in the pathogenicity of nosocomial isolates by assessing the virulence of an isogenic phage03 mutant of
E. faecalis V583. The mutant displayed attenuation in nematode killing as well as in the
G. mellonella model compared to the parent strain, providing additional evidence that phage03 plays a role in
E. faecalis virulence. To our knowledge, this is the first study to have demonstrated that a phage element endows clinical and nosocomial isolates with factors promoting increased pathogenicity
in vivo during infection in a model organism.
Though the mechanism by which phage03 impacts
E. faecalis virulence remains to be elucidated, the ability of this element to form infective virions indicates that its genes are expressed and that it can act as an efficient vehicle for horizontal gene transfer, spreading traits contributing to boosting strain fitness (
44). Excision of the phage has been associated with treatment with fluoroquinolones (
44), supporting the idea that application of antibiotics in the nosocomial environment may facilitate the creation of conditions favorable for the emergence of highly pathogenic clonal lineages.
Analysis of the phage03 sequence revealed the presence of proteins that could play a role in bacterial virulence.
ef1420 encodes a protein of unknown function previously reported to significantly decrease
E. faecalis infectivity in the
G. mellonella model (
62). This protein was predicted to contain a lipobox motif and is expected to be a surface-exposed element (
45,
63). Lipoproteins represent approximately 25% of the proteins predicted to be associated with
E. faecalis cell envelope, and they are reported to be involved in substrate binding and in delivery to ABC transporters, acquisition of sugars, protein folding, antibiotic resistance, and cell envelope stability (
64,
65). In addition, lipoproteins play a significant role in virulence, as they mediate adhesion to the extracellular matrix (ECM), initiation of the inflammatory process followed by activation of the immune system, induction of phagosome escape, and translocation of virulence factors (
63).
Besides EF1420, two proteins (EF1460 and EF1474) contain a LysM domain in the C terminus thought to contribute to the anchoring to peptidoglycan (
66) whereas EF1426 is a VirE domain-containing protein conceivably associated with virulence by exerting a role in the type IV secretion pathway in other organisms (
67). It is noteworthy that many phage03 proteins have an unknown function but nevertheless might play a role in and contribute to
E. faecalis pathogenicity.
In conclusion, using a statistical and comparative genomic approach combined with an in vivo assessment of E. faecalis virulence during infection in C. elegans, our results establish a set of nonorthologous gene patterns that may be predictive of E. faecalis pathogenicity. Our report highlights the importance of MGEs as potential contributors to enterococcal pathogenicity and demonstrates that phage03, a prevalent element among nosocomial clonal lineages, is responsible for enhanced virulence. Future functional investigations are required to validate any involvement of the identified patterns in E. faecalis virulence and to gain a deeper understanding of the genetic mechanism underlying enterococcal pathogenesis.