QC deletion mutants are non-viable
In agreement with transposon mutagenesis studies deeming PgQC essential (
8), we were unable to create a viable PgQC deletion mutant (
4). However, close inspection of the genomic locus revealed a complex arrangement of three open reading frames (
PG_2157,
PG_2158, and
PG_2159), indicating that the PgQC gene (
PG_2157) is the first of a three-gene operon (
Fig. 1A). The accompanying genes are not well characterized but could be sensitive to polar effects when PgQC is mutated.
PG_2158 encodes a protein with an Fe-S center whose function is unknown, whereas
PG_2159 (
hemG) encodes a protoporphyrin oxygenase involved in iron acquisition (
18). To determine whether these genes are essential, we constructed two additional deletion mutants (Δ
PG_2158 and Δ
PG_2159) and the control strains QC+ and PG_2159+ (
Fig. 1A). In all these strains, the colony morphology and growth rate were indistinguishable from the wild-type strain, and gingipain activity was normal, with some increase for soluble Kgp activity (
Fig. 1B and C; Fig. S1A and B). These results show that
PG_2158 and
PG_2159 are not essential for
P. gingivalis at least
in vitro, supporting our hypothesis that the pyroglutamyl formation of secreted proteins is essential for
P. gingivalis growth.
Plasmid stability assay confirms that QC is essential
To provide more evidence that PgQC is essential, we used a system in which gene significance can be inferred from the stability of a plasmid carrying a candidate essential gene in the absence of antibiotic selection pressure. For this purpose, we engineered a QC
m strain (m = merodiploid, reflecting the presence of two copies of the
PgQC gene, one in the genome and the other on the plasmid). Next, we replaced the genomic
PgQC coding sequence with the
ermF cassette, resulting in a QC
p strain (p = plasmid, now the only copy of
PgQC). Initial characterization of the QC
m and QC
p strains showed that the production of PgQC protein from the plasmid was enhanced compared to the control containing the empty plasmid (pTIO2-tet) (
Fig. 1D). We did also observe some increase in gingipain activity (Fig. S1D and E), and as expected, there was no impairment in pigmentation phenotype (
Fig. 1E), which is dependent on gingipain activity. The QC
p strain grew more slowly, and the generation time (
g) was longer than the other strains (wild type,
g = 2.42 h; pTIO2-tet,
g = 3.19 h; QC
m,
g = 2.76 h; and QC
p,
g = 4.83 h). The stationary phase biomass of the QC
p strain was ~40% lower than that of the QC
m strain (
Fig. 1F).
In the plasmid stability experiment, the QC
m and QC
p strains were maintained for eight generations in the absence of antibiotics. The number of cells carrying the expTIO_QC_tet plasmid was determined on days 0 and 8 by comparing the number of colonies on tetracycline vs control plates without antibiotics (
Fig. 1G). We found that the QC
m strain was prone to curing, resulting in a lower number of antibiotic-resistant colonies on day 8 compared to day 0. In contrast, the QC
p strain maintained the plasmid even in the absence of selection, and the number of antibiotic-resistant colonies was similar on day 8 compared to day 0 (
Fig. 1H). Taken together, these data confirm that PgQC expression is indispensable for
P. gingivalis viability.
PgQC is essential solely due to its enzymatic activity
PgQC may be essential to
P. gingivalis because its enzymatic activity is indispensable, or it may fulfill a non-catalytic role such as scaffolding or interacting with other proteins. Indeed, the spatial and functional coexistence of PgQC and the Sec translocon in the IM suggests that protein–protein interactions may contribute to the stabilization and/or function of Sec components (
4). We therefore attempted to generate a
P. gingivalis strain expressing catalytically inactive PgQC (mutation D126A). However, these attempts were unsuccessful, suggesting that PgQC enzymatic activity is necessary for
P. gingivalis growth. This does not exclude the possibility that PgQC protein–protein interactions are also important, so we tested a possible dominant negative effect with an overexpression of the inactive enzyme (QC
D126A) in wild-type
P. gingivalis to see if it would inhibit growth. We introduced the plasmid overexpressing PgQC
D126A (expTIO-QC-tet-D126A) and found that the resulting QC
mD126A (mD126A = merodiploid overexpressing the inactive enzyme) mutant strain was fully viable. Like the QC
m strain, also, the QC
mD126 strain produced much larger amounts of PgQC protein than the control strain with the empty plasmid (pTIO2-tet) (
Fig. 2A), but, nevertheless, its enzymatic activity was similar to that of the control (
Fig. 2B). Notably, the QC
mD126A mutant strain showed no difference in pigmentation phenotype (
Fig. 2C), growth kinetics (
Fig. 2D), or gingipain enzymatic activity (
Fig. 2E and F) compared to the control strain. Moreover, proteomic analysis of pyroglutamyl formation revealed no significant differences between the wild-type and QC
mD126A strains (Table S4). Collectively, these results confirm that the importance of PgQC for
P. gingivalis depends solely on the pyroglutamyl formation of proteins translocated across the IM by the Sec system.
Although these results suggest that physical interactions with Sec components are not an essential function of PgQC, its localization on the periplasmic side of the IM may be necessary for the modification of emerging N-terminal Gln residues as proteins are exported into the periplasm. Therefore, we replaced the Cys residue in the lipobox with Gln to produce mutant strain PgQCC20Q. This was phenotypically indistinguishable from the parental strain (Fig. S2A through C), and the QC activity was also at the same level (Fig. S2D). However, the protein was distributed between the IM and periplasm (Fig. S2E) in the form of full-length QC and a truncated version, probably with a cleaved signal peptide. This makes it impossible to determine whether the association between QC and the IM is important for its function.
Heterologous expression of QC enzymes from other bacteroidetes species
Given the large proportion of
P. gingivalis secreted proteins that follow the Q-rule, we investigated the essential function of PgQC in more detail by replacing it with orthologs from other Bacteroidetes species with different proportions of Q-rule proteins:
Porphyromonas macacae (PmQC),
Porphyromonas somerae (PsQC),
T. forsythia (TfQC),
Prevotella intermedia (PiQC),
Barnesiella intestinihominis (BiQC), and
Pedobacter ginsenosidimutan (PedgQC). In addition to these animal-type QCs, we also replaced PgQC with the plant-type QCs expressed by
Nonlabens sediminis (NsQC) and
Alistipes indistinctus (AiQC). This allowed us to test QCs with a high level of sequence and structural divergence (
Fig. 3A; Fig. S3A). The mutant strains were constructed by in-phase substitution of the PgQC coding sequence downstream of the lipobox with the coding sequence of heterologous QCs (without the signal peptide, but including a C-terminal Strep-tag).
Most of the substitutions yielded viable
P. gingivalis mutant strains with colony morphology, generation time, stationary phase biomass, and gingipain activity indistinguishable from the parental strain (
Fig. 3B and C; Fig. S3B and C). However, mutant strains were not recovered for AiQC or BiQC. The presence of PsQC
strep, PiQC
strep, and PedgQC
strep could not be detected on western blots probed with anti-Strep or anti-QC antibodies (
Fig. 3D), but all mutant strains produced measurable QC activity that differed significantly from strain to strain, with the lowest values in the strains expressing NsQC and PsQC (
Fig. 3E). Remarkably, although the QC activity was 44-fold lower in NsQC and 22-fold lower in PsQC compared to wild-type
P. gingivalis, this was still sufficient for the pyroglutamyl formation of Sec-exported proteins at a level that supported normal bacterial growth (Table S4). Although the QC activity of
P. gingivalis mutant strains expressing TfQC and PiQC correspond well to the activity of native
T. forsythia and
P. intermedia strains (
19), we cannot exclude the possibility that NsQC and PsQC expressed by
P. gingivalis are fully active on physiological substrates but exert a minute activity against the reference synthetic substrate.
The subcellular location of heterologous QCs was determined by western blot, revealing that most were anchored into the IM, with only NsQC
strep found exclusively in the OM (
Fig. 4F). Interestingly, the IM retention signal of
E. coli lipoproteins is defined by Asp (+2 residue) following lipidated Cys (+1 residue), whereas glycine (Gly +2 or Gly +3) may fulfill the same function in
P. gingivalis (
20). Our experimental data provide supporting evidence because only NsQC was anchored into the OM and only this enzyme lacks a Gly residue in the Cys(+1)-Lys(+2)-Thr(+3) motif (Fig. S3A). It is remarkable that
P. gingivalis, which is clearly dependent on PgQC, can survive despite the very low activity of heterologous enzymes, regardless of their type and subcellular location.
The role of N-terminal pGlu formation
The unequivocally verified dependence of
P. gingivalis on PgQC activity suggests that (i) there are one or more essential secreted proteins that require N-terminal modification for their stability or activity or (ii) the overall lack of pyroglutamyl formation may exert a cumulative detrimental effect. We, therefore, inspected all proteins with a signal peptide in the new reference genome NZ_CP025932.1. Where possible, we assigned a cellular localization, protein structure, molecular function, and biological process based on published research, databases (UniProt, KEGG, and AlphaFold), and BLAST homology searches. Among 210 secreted proteins, 164 (77.14%) appeared to follow the Q-rule and are, therefore, potential substrates of PgQC (
Table 1; for details, see Table S5). These included 41 (20%) annotated as “conserved hypothetical proteins,” indicating that their function cannot be predicted because, at most, they are only distantly related to characterized proteins.
The very high prevalence of Gln following the signal peptide (Q-value) occurs among proteins predicted to be functionally and structurally associated with the OM (88.4%), T9SS components and cargo proteins (83.3%), remnants of the type 4 secretion system (T4SS) (100%), periplasmic chaperones (85.7%), and, surprisingly, the products of genes that were deemed essential by transposon mutagenesis (84.6%) (
8). The Q-value of proteins involved in peptide catabolism, peptidoglycan biosynthesis, and cell response/regulation was 61.5–75%, but it was much lower for proteins involved in carbohydrate catabolism (27.3%) and lipopolysaccharide synthesis (25.0%). These potential PgQC substrates were found in all extracytoplasmic compartments. We also looked for conserved secondary structures at the N-terminus using Alpha-fold, but few crystallographic structures are available. The analysis revealed a tendency for β-sheet proteins, but this structure was predicted unambiguously only for a subgroup of OM β-barrel proteins.
We, therefore, tested the effect of replacing the key Gln with Ala or Asn in four of the OM β-barrel proteins (PorG, PorQ, PorT, and PorV) and two predicted periplasmic proteins (PG_0320 and PG_1788). Homologous
P. gingivalis strains expressing each mutated protein were generated, and protein levels were determined by probing western blots with antibodies specific for the target proteins (PorQ and PorV) or antibodies specific for the C-terminal His
6 tag (all other proteins). In all cases, the mutated protein accumulated to a significantly lower level than the normal protein in the control strain (
Fig. 4A through K, left panels). This deficit was not due to transcriptional repression because, generally, the mutants showed elevated gene expression compared to the control (
Fig. 4, right panels). The mutants were generally indistinguishable from the parental strains in terms of growth kinetics, pigmentation phenotype, and gingipain activity/distribution (Fig. S4 and S5). However, the
porT Q30N,his strain was an exception, with slower growth than the control strains (Fig. S4C). Cumulatively, these results argue that pyroglutamyl formation is important for the stability of proteins exported to the periplasm and OM, apparently protecting them from proteolytic degradation and truncation, clearly visible in the case of PG_1788
Q21N,his (
Fig. 4F). This was confirmed by the analysis of gingipains, which in the absence of a functional T9SS accumulate in the periplasm as multidomain single-chain proenzymes at a significantly lower level compared to control if the N-terminal Gln is replaced with Asn (
Fig. 4G through I).
None of the mutant strains discussed above showed any loss of viability, but the target genes are known to be nonessential for
in vitro growth on complex medium. We, therefore, repeated the same approach, but this time we mutated two genes (
PG_0076 and
PG_0449) that were previously shown to be essential by transposon mutagenesis (
8). In both mutant strains, the N-terminal Gln was replaced with Asn. Both mutant strains were viable and were indistinguishable from the parental strains in terms of growth, pigmentation, and gingipain activity (Fig. S4J, K, S5J and K). This was the case, despite the much lower level of PG_0076 protein and the truncation of PG_0449 (
Fig. 4J and K).
In order to determine the impact of pyroglutamyl formation on extracytoplasmic proteins of
P. gingivalis, we purified the modified PG_0449 protein from PG_0449
his and PG_0449
Q22N,his strains and subjected to N-terminal sequencing. The results revealed 13 amino acid truncations of the N-terminus of PG_0449 in the latter strain indicating N-terminal proteolytic degradation. Nevertheless, for other substrates, we cannot exclude a direct structure/function stabilizing effect of pyroglutame formation. Generally, the mutated genes were expressed at a higher level than their wild-type counterparts, but the increase in mRNA abundance was inversely correlated with the amount of protein (
Fig. 4B, D through H and K). Although a lower rate of translation or a difference in mRNA stability cannot be excluded, this result strongly suggests the presence of a feedback loop that enhanced transcription but was unable to compensate for the degradation of the modified protein. This seems to be a common phenomenon when Gln is replaced, resulting in low levels of mutated proteins compared to the parental strains (
Fig. 4L).