INTRODUCTION
Enterococcus cecorum is a commensal bacterium of the gut microbiota of adult chickens (
1–3). This bacterium has emerged over the last 20 years as a significant cause of locomotor disorders in poultry worldwide, particularly in fast-growing broilers (
4). In France, reports on
E. cecorum between 2006 and 2018 have shown an increase from 0.3% to >7% of total avian infections, the majority of which include locomotor disorders in broilers (
5).
E. cecorum is mostly responsible for osteomyelitis, spondylitis, vertebral osteoarthritis, and femoral head necrosis (
1,
6,
7), causing substantial losses in broiler production due to culling, mortality, condemnations at the slaughterhouse, veterinary costs, and increased exposure to antibiotics (
6,
8,
9). The development of
E. cecorum infections is multifactorial and depends on host genetics, rapid growth, feed composition, husbandry procedures, and animal density in combination with the pathogenic potential of the bacterium (
10–13). The association between the early intestinal carriage of
E. cecorum and an increased risk of infections points to the gastrointestinal tract as a route of infection (
10,
11,
14,
15). Recent studies indicate that lesion-associated
E. cecorum isolates appear to be adapted to colonize the gut early in life, in contrast to nonclinical isolates (i.e., strains isolated from the gut of healthy birds), which do not colonize to a detectable level before week 3 (
10,
14). It was also proposed that disinfection failure may contribute to
E. cecorum persistence and outbreaks due to biofilm formation (
7,
16–18). Although suggested by the prediction of host-binding proteins in the genome (
19), adhesion to host tissue proteins has been overlooked, and the robustness of
E. cecorum biofilms and associated properties remain to be investigated. It is also assumed that in Gram-positive bacteria, the thick layer of peptidoglycan that surrounds the cytoplasmic membrane confers resistance to the bactericidal activity of serum; for instance, human serum selectively kills commensal
Enterococcus faecium strains, whereas disease-associated
E. faecium strains are not susceptible (
20). Assessing the pathogenic potential of
E. cecorum isolates remains a challenge. Recently, an
in vivo model has been used to compare the pathogenicity of two clinical isolates under field conditions, but it is applicable to only a limited number of strains (
3). Less limiting, the chicken embryo lethality assay (CELA) has shown tendencies where pathogenic strains kill more efficiently than commensal isolates (
14,
21).
Several molecular epidemiological studies based on pulsed-field gel electrophoresis (PFGE) patterns of commensal and clinical isolates from the United States, Canada, Belgium, the Netherlands, Germany, and Poland agree that commensal isolates have higher diversity than clinical isolates, suggesting the evolution of specific clones with greater pathogenic potential. However, clinical isolates exhibited multiple PFGE patterns, supporting the hypothesis of the polyclonal nature of the infectious isolates (
22–26). Furthermore, repeated outbreaks with genotypically related isolates within farms and local areas substantiate horizontal transmission and a farm-related reservoir (
7,
17,
24,
26,
27). To date, there are only 1 complete (type strain NCTC12421 [GenBank accession number
NZ_LS483306.1]) and 84 partial nonredundant genomes of
E. cecorum available. Only two comparative genomic studies of
E. cecorum isolates from the United States have been performed (
19,
28). A comparison of the partial genomes of three commensal and three clinical isolates from the southeastern United States isolated between 2010 and 2011 indicated that the pathogenic
E. cecorum strains had smaller genomes with >120 genes absent or whose products had <40% identity in the commensal isolates (
19). On the other hand, ~70 genes of the nonclinical isolates were absent or encoded products with <60% identity in the clinical isolates. In line with studies reporting a high rate of clinical isolates unable to metabolize mannitol (
14,
24,
25), the orthologs of mannitol phosphate dehydrogenase, the mannitol operon activator, as well as mannitol-specific component IIA of the phosphotransferase system (PTS) were not found in clinical isolates (
19). In another study, the partial genomes of nine clinical isolates isolated in Pennsylvania in 2008 and 2009 were compared with those of nine nonclinical isolates from the National Antimicrobial Resistance Monitoring System isolated between 2003 and 2010 (
28). The trend of a slightly smaller genome size for clinical isolates was confirmed and was consistent with a larger accessory genome of nonclinical isolates. Noticeably, the nonclinical genomes had more antibiotic resistance genes. By combining available
E. cecorum draft genome sequences (
29,
30), the core genome was estimated to have 1,436 genes (
28). Phylogenetic analysis of the core genome led the authors of this study to conclude that the isolates cluster independently of their clinical or nonclinical status, which raises the question of whether clinical isolates of
E. cecorum belong to specific genetic groups. The objective of this study was to provide better insight into the genomic organization and phenotypic diversity of
E. cecorum clinical isolates from broilers circulating between 2007 and 2017 in Brittany, the leading French commercial broiler-producing area. We performed whole-genome sequencing of more than 100 poultry and human clinical isolates in order to better define the extent of the genetic relatedness of clinical isolates and detect genes associated with virulence-related traits. We completed this genomic analysis by testing isolates for their adhesion to type II collagen, biofilm robustness, and growth in chicken serum. The overall genetic diversity of
E. cecorum was investigated by pangenome analysis, with a particular focus on mobile genetic elements (MGEs), antimicrobial resistance genes (ARGs), and genome-wide association studies (GWASs) of the accessory genes and the phenotypic traits.
DISCUSSION
E. cecorum has emerged as an opportunistic pathogen in poultry worldwide. The present study shows the clonality of a large collection of clinical E. cecorum isolates collected between 2007 and 2017 in the leading French commercial broiler-producing area. It identifies the main phylogenetic clades and subclades and provides the first insight into the intercontinental clonality of clinical isolates of E. cecorum from poultry. Six genes significantly associated with the origin of the isolates allow the discrimination of 94% of the avian clinical isolates of the collection from the nonclinical ones. Based on the new complete genomes, we also provide insight into the diversity of the mobile genetic elements of E. cecorum that carry ARGs.
One way of assessing the diversity of a species is to analyze its core and accessory genomes (
44–46). Although they should be considered rough estimates due to the use of draft genomes, the current
E. cecorum core genome represents only 14.1% (1,207/8,523 CDSs) of the pangenome and ~50% of the average
E. cecorum genome. While the size of the core genome is within the range of the one reported previously by Sharma et al., the
E. cecorum pangenome is now 35% larger than previous estimates due to the addition of 130 genomes to the 18 genomes used previously (
28). The increase in the genome size corresponds to an average of only 20 genes per genome; however, the low gene discovery rate is probably due to the high proportion (93%) of clinical poultry isolates in clade E. Indeed, clade E isolates (
n = 117) accounted for 22% (1,873/8,523) of the accessory genes, while more distantly related isolates of clades A (
n = 2), B (
n = 5), C (
n = 16), and D (
n = 8) accounted for 34.0% (2,897/8,523) of the accessory genes. The proportions of core genes and accessory genes strongly correlate with the lifestyle of the bacterium. A small core genome compared to a large pangenome reflects the diversity of hosts and lifestyles, as observed for
E. coli sensu stricto and
Salmonella enterica, two ubiquitous species with commensal or pathogenic lifestyles whose core genomes account for 0.39% and 1.9% of the pangenome, respectively (
47,
48). Conversely, a high proportion of core genes reflects more restricted lifestyles, such as those of
Bacillus anthracis (65%), an obligate pathogen, and
Staphylococcus aureus (36%) and
Streptococcus pyogenes (37%), two human-restricted pathogens (
45). Broader sampling from different hosts and countries is needed to further evaluate the diversity of the
E. cecorum species.
Genome reduction associated with pathogenicity is observed in many bacteria, including
Streptococcus suis and
Streptococcus agalactiae, whose genome sizes are reduced in virulent host-adapted isolates (
49–52). Although longer
E. cecorum genomes belonged to clinical poultry isolates, the average genome sizes were similar for all poultry isolates regardless of their clinical status (2.40 ± 0.14 and 2.36 ± 0.11 Mbp for clinical poultry isolates and nonclinical poultry isolates, respectively). This observation differs slightly from those of two previous studies on isolates from the United States, where clinical isolates had shorter genomes than nonclinical isolates (
19,
28). The latter is most likely due to the small number of genomes examined and the sampling of clinical isolates, which, according to our findings, all belong to the same phylogenetic subclade, E12. In the current data set, the smallest genomes correspond to isolates from phylogenetic subclade E11, and the largest genomes correspond to isolates from subclade E4. Interestingly, all but one of the E4 genomes (
n = 27) have no CRISPR-
cas systems, correlating with abundant ICEs, GIs, and prophages.
Previous molecular studies, using mainly PFGE, have converged toward the genetic homogeneity of
E. cecorum clinical isolates from the same country compared to nonclinical isolates (
19,
22–28). Despite more than 6,500 cases of
E. cecorum infections in poultry reported in France since 2007 (
53), the genetic diversity of
E. cecorum in France and the genetic relatedness with isolates from other countries have never been studied. Phylogenetic analysis of 100 isolates spanning the period from 2007 to 2017 showed that a single clade (E) was responsible for almost all (96%) cases in farms between 5 and 90 km apart in Brittany. The phylogenetic congruence of clinical isolates from the United States is consistent with two hypotheses: (i) isolates were issued from transcontinental dissemination or (ii) isolates from different regions suffered comparable selective pressures. The use of a limited number of commercial genetic lines of broiler chickens may have contributed to the selection of
E. cecorum clones with pathogenic potential and allowed transcontinental dissemination. While the hypothesis of the transcontinental spread of clade E isolates through the meat trade is unlikely, trade of live animals (
54), surface-contaminated eggs, or transport by wild birds may well have contributed to this spread. The second hypothesis, although less likely, is parallel convergent selection due to breeding conditions. The isolates in this study may not represent the full diversity of the French clinical population of
E. cecorum; however, their temporal distribution reflects the spread and persistence of a clade particularly adapted to broilers with the emergence over time of some more successful subclades. This is illustrated by the dominance of subclade E4 (82%) in France and subclade E12 (75%) in the United States between 2008 and 2010 in the current data set. Subsequently, the dominant subclades were E6, E13, E11, and E10, the latter two of which were dominant on French farms in 2016. The temporary circulation of specific subclades may be due to natural evolution or adaptive changes in response to modifications of breeding practices (like novel biocides and cleaning procedures or different feed origins, compositions, or additives), but the reasons remain to be determined. Although less frequent, clinical isolates were also found in clades C and D, which contain nonclinical isolates and display higher diversity. Noticeably, isolates of these two clades have multiple ARGs, which may confer a selective advantage and thus may contribute to the pathogenic potential under specific conditions that remain to be determined. Additional genomes of nonclinical isolates but also clinical isolates from diverse countries and other poultry species and husbandry systems are required to obtain a comprehensive view of the
E. cecorum population structure and determine whether
E. cecorum clade E isolates are broiler specific.
E. cecorum has occasionally been involved in human infections (
55–57). Four of the six clinical human isolates in this study were clade E isolates, supporting a poultry origin. However, this does not lead to the conclusion that contamination was foodborne. The two other clinical human isolates belong to clade A and are phylogenetically close. Both have large clusters of highly specific genes, including an iron transporter and an ~60-kbp motility locus characterized by flagellar and chemotaxis genes, which may have been acquired by horizontal gene transfer from other species of enterococci such as
Enterococcus casseliflavus,
E. gallinarum, or
E. columbae encountered in birds but also other animals, including humans, insects, and aquatic hosts for the first two (
58).
The dominance of
E. cecorum clinical isolates from clade E strongly supports the hypothesis that clade E isolates have acquired properties that increase their fitness and/or infectivity. At the core-genome level, all clade E isolates share SNPs that may confer a selective advantage to the host. Of the two nonneutral mutations, one is in the phage shock protein gene
pspC that encodes an ortholog of the transmembrane protein LiaY, probably involved in resistance to cationic antibiotics and antimicrobial peptides (
59). Codon changes leading to synonymous SNPs may also modify the translation efficiency as such to foster cell fitness (
60). In addition to the clade E-specific mutations in the core genes, 83 accessory genes were found to be enriched in the clade E isolates, of which 13 genes are of the capsule operon. The capsule is an important virulence factor to evade host immunity, including for enterococci (see reference
61 for a review on the subject). We identified two closely related capsule loci in 81.2% of the clade E isolates. Similarly, Borst et al. (
19) identified in the three clinical isolates (SA1 to SA3) of their study a capsular polysaccharide locus, which corresponds to that of the CIRMBP-1228 genome found most frequently in the clade E isolates. The strong association between these capsule loci and the clinical isolates suggests a role in virulence, probably by promoting immune evasion (
62). However, virulence, especially for opportunistic pathogens, is a multifactorial process involving multiple bacterial traits such as metabolic functions and stress resistance (
63,
64). Other clade E-enriched genes may give
E. cecorum alternative metabolic capacities to survive and/or multiply in the host. The predicted galactitol phosphotransferase system (CIRMBP1228_02729 to CIRMBP1228_02731), galactonate catabolism enzymes (CIRMBP1228_02728 and CIRMBP1228_02732), as well as the biotin biosynthesis genes may give
E. cecorum an advantage in competing with commensal species in nutrient-limited environments such as the gastrointestinal tract, considered a portal of entry during the first week of life of the host (
10). Of note, the mannitol-1-phosphate 5-dehydrogenase (
mtlD) gene proposed to be specific for nonclinical isolates (
19) was not discriminatory between nonclinical and clinical poultry isolates in our collection because it was present in only seven isolates of different origins. Conversely, we selected 6 genes, which, taken together, allow the discrimination of more than 90% of the clinical isolates. The combined detection of these candidate genes in a larger collection of clinical and nonclinical isolates is necessary in order to evaluate their use for the detection of the early carriage of potentially pathogenic isolates, particularly during the first week of life. In contrast, while ascorbate catabolism genes are dispensable in 70% of the clinical avian isolates, they may confer a competitive advantage to avian nonclinical isolates and human clinical isolates.
MGEs, including prophages, are major contributors to the evolution of the gene repertoire. We identified 75 MGEs corresponding to predicted ICEs or related elements and GIs in the completely sequenced genomes and a total of 105 complete prophages in the 118 sequenced genomes. Three GIs have a composite structure with phage genes and more than one integrase gene, likely resulting from the independent integration of different MGEs (
38,
65).
E. cecorum MGEs are integrated into the 3′ ends or the intergenic regions of genes encoding a ribosomal protein (
rpsB,
rpsI,
rpsF, and
rpmE) and a few tRNA genes or riboswitches (
tRNA-Thr, transfer-messenger RNA [
tmRNA], and
preQ1) but also various other intergenic regions. This is consistent with the different site specificities of the prevalent integrases of the tyrosine and DDE recombinase families (
36,
38,
39), although there is relatively little integration into the tRNA genes that are frequently targeted by tyrosine integrases in streptococci (
66). The apparently nonrandom distribution of the
E. cecorum MGEs integrated relatively far away from the chromosomal origin of replication may be related to the eviction of highly expressed genes located near the origin of replication (
67,
68). Among the prophages that we identified, prophages homologous to PHAGE_Strept_5093, PHAGE_Entero_EFC_1, and PHAGE_Bacill_phBC6A52 were also detected in clinical and nonclinical poultry isolates from the United States (
28). In contrast, and with the exception of Tn
917, the ICEs and GIs identified in the French isolates are not well conserved in the U.S. isolates, indicating that they were acquired separately and contributed to local adaptation. In addition to the type IV secretion system involved in the formation of the DNA translocation channel, ICEs encode cell surface adhesins for attachment to the target cell. These include LPxTG cell wall-anchored adhesins such as
S. agalactiae antigen I/II family adhesins, also referred to as group B
Streptococcus surface proteins (Bsps) (
69,
70). These structural proteins were also shown to promote biofilm formation, interactions with host cells, and virulence (
71,
72). We identified Bsp-like proteins in Tn
GBS1- and Tn
GBS2-related ICEs, proteins containing CnaB domains initially found in microbial surface components recognizing adhesive matrix molecules (MSCRAMMs), and VaFE repeat-containing surface-anchored proteins in diverse
E. cecorum ICEs and GIs, yet none of these were associated with a specific trait or origin. However, the variability of these adhesins may hinder any association, as they may also compensate for each other. ICEs and GIs may carry diverse genes, known as cargo genes, that are not involved in gene transfer but may confer a selective advantage to the host strain. We have identified several
E. cecorum cargo genes encoding toxin-antitoxin modules that function as MGE addiction systems but are also involved in the control of bacterial growth (
73). Other cargo genes encode restriction-modification systems that protect the cell against horizontal gene transfer or are genes involved in protection against oxidative stress or resistance to cadmium or arsenate, which could confer better fitness contributing to ecological adaptation. An accessory SecA2-SecY2 operon was also identified in strain CIRMBP-1228. Such systems are dedicated to the export of glycosylated serine-rich repeat proteins (SRRPs) that participate in adhesion to host cells and/or biofilm formation (
74,
75). Functional analysis of a few selected strains is required to evaluate whether and how these accessory genes contribute to adaptation to environmental challenges.
ARGs are other clinically important cargo genes spread by ICEs and GIs (
76). The ARGs identified in
E. cecorum confer resistance to tetracycline, macrolides, bacitracin, aminoglycosides, and, much more rarely, glycopeptides. The most prevalent ARGs are the
tet(M),
tet(L), and
erm(B) genes. This is in line with the high prevalence of resistance to tetracycline and erythromycin in
E. cecorum found in various studies, as reviewed by Jung et al. (
12), and the use of tetracyclines and macrolides despite substantial efforts to reduce their use in veterinary medicine. As anticipated from the literature,
tet(M) is carried on ICEs of the Tn
916 family, and
erm(B) is carried on Tn
917 (
77). Other macrolide resistance genes such as
mef(A),
msr(D), and
lnu(B) and aminoglycoside resistance genes such as
ant(6)-Ia and
aph(3′)-III are prevalent in U.S. isolates (
28). The
mef(A),
msr(D),
vat, and
erm(C) genes are located on the same GI in CIRMBP-1246 (CIRMBP1246_01012 to CIRMBP1246_01050),
lnu(C) is adjacent to the IS
1595 family transposase IS
Sag10, and the two adjacent genes
lnu(B) and
lsa(E) are next to the IS
1595 family transposase IS
Cpe8, previously described in an avian
Clostridium perfringens strain carrying the lincomycin resistance gene
lnu(P) on the plasmidic transposable element tIS
Cpe8 (
78). Another prevalent ARG is the bacitracin resistance operon
bcr in isolates of clades C and D and subclades E10, E11, and E12. This operon is frequently associated with the
tet(M) and
tet(L) genes on ICEs of the Tn
916 family. The highly conserved nucleotide sequence of the
bcr operon, including the flanking element IS
Enfa1, and its location on Tn
916-like elements or GIs are consistent with avian interspecies transmission involving
E. faecalis,
E. faecium, and
C. perfringens (
79,
80). Note that the carriage of
tet(M) and
tet(L) on the same Tn
916-like element is uncommon. It was first described in
Streptococcus gallolyticus and was proposed to benefit the host bacterium under stressful conditions (
81,
82). A single clinical poultry isolate from France has the
vanA operon, a gene previously described in an
E. cecorum strain from retail poultry in Japan (
83). Overall, 26% (
n = 39) of the isolates carry multiple ARGs (4 to 10) conferring resistance to at least three antimicrobial families and are prevalent in clinical and nonclinical isolates of clades C and D and subclades E11 and E12. The very few strains without predicted ARGs and the differential ARG profiles between French and U.S. isolates probably reflect strong antibiotic selective pressure that differs between the two countries. Indeed, in-feed bacitracin and in-feed macrolides are still used in poultry farming in the United States (
84). The successive European bans of antibiotics (avoparcin in 1997; bacitracin, spiramycin, tylosin, and virginiamycin in 1999; and avilamycin and flavophospholipol in 2006) and French national EcoAntibio plans (
85,
86) launched in 2012 and 2017 to fight antimicrobial resistance in animal health and promote the responsible use of antibiotics might have contributed to containing the spread of ARGs and reducing MLS resistance genes, as observed in recent isolates of subclade E10. In fact, this is in line with the decreasing trend of macrolide resistance of
E. cecorum strains isolated from French poultry according to the French Surveillance Network for Antimicrobial Resistance in Bacteria from Diseased Animals (RESAPATH) (
https://shiny-public.anses.fr/resapath2/). However, there has been a marked increase in bacitracin resistance genes in French isolates since 2015, even though bacitracin is not used in avian veterinary medicine in France (
87). Bacitracin is produced by
Bacillus licheniformis and
Bacillus subtilis strains. With the need for alternatives to antibiotics in livestock, bacillus strains are used as probiotics or applied together with lactic acid bacteria as a protective biofilm against pathogens (
88–90). The increase in
E. cecorum isolates carrying the
bcr operon points to the need to examine whether
Bacillus strains applied on farms produce bacitracin or related antimicrobial compounds that could contribute to the dissemination of the
bcr operon. Reassuringly, relatively few aminoglycoside and vancomycin resistance genes that target gentamicin and glycopeptides, two critically important antimicrobials in human medicine, are found in French isolates, as elsewhere in Europe (
12).
Overall, the results of this study shed light on the population of E. cecorum clinical isolates in France and reveal a genetic linkage with E. cecorum clinical isolates from elsewhere. We have shown that based on the available data, the majority of clinical poultry isolates are phylogenetically distinct from nonclinical poultry isolates and form a main clade responsible for the outbreaks of E. cecorum in France and probably the United States and Europe. ICEs and GIs are the main carriers of antimicrobial resistance. The E clade of E. cecorum appears to have adapted to the conditions under which poultry are reared, highlighting its importance as an emerging threat to the poultry industry worldwide. This information can be used to design and guide preventive strategies to reduce the impact of E. cecorum clade E isolates.