Alignment of the amino acid sequence of TgCA_RP with those of other α-CA sequences (
Fig. 1A; see also
Table S1 in the supplemental material) and with orthologous sequences from other apicomplexan parasites (
Fig. 1B) illustrates that TgCA_RP is more similar to the
Plasmodium η-CA, PfCA (PF3D7_1140000), than the α-CAs (
Fig. 1). The position of TgCA_RP in a neighbor-joining tree of alpha, beta, eta, and zeta family CAs supports this conclusion (
Fig. S2). TgCA_RP also shares several important features with PfCA (
12). The canonical α-CA zinc coordination domain consists of three histidines identified by their position in the human carbonic anhydrase 2 (HuCA2) sequence (His
94, His
96, and His
119) (
Fig. 1A, *, **, and ***). The third histidine (His
119) of the zinc coordination domain is replaced with a glutamine (Gln) in the η
-class enzyme PfCA (Gln
320,
Fig. 1B, ***) (
11). In the case of TgCA_RP, two of the three histidines of the Zn
2+ coordination domain are substituted. Like PfCA, the position of the third histidine is occupied by a Gln (Gln
254,
Fig. 1A, ***), but in addition, the first His is changed to a phenylalanine (Phe
233, Fig, 1A, *). These two substitutions in an essential domain for Zn
2+ binding indicated that TgCA_RP would be inactive (
13) (
Fig. 1A and
B). We compared the Zn
2+ coordination domain of TgCA_RP with the same domain of other apicomplexan CA orthologs (
Fig. S1).
Neospora caninum (NcCAh),
Eimeria tenella (EtCAh),
Theileria parva (TpCAh), and
Theileria equi (TeCAh) show a replacement of the first His in the Zn
2+ binding domain with Phe, Leu, or Asp (Phe
236, Leu
209, Asp
200, and Asp
196, respectively). The
Plasmodium berghei (PbCAh) and
Plasmodium yoelii (PyCAh) sequences replaced the His in the Zn
2+ coordination domain with asparagine (Asn
328 and Asn
480,
Fig. 1A, *). PfCA is, the only protein of this group that possesses a histidine in this position. These substitutions, specifically with asparagines, do not necessarily preclude activity. Previously, Lesburg et al. (
14) substituted an asparagine for His
94 in HuCA2, and the enzyme retained activity, although activity was markedly reduced (
14). It is also possible that PfCA is the only enzymatically active carbonic anhydrase of this group and this question will be answered only once the other enzymes are characterized. TgCA_RP also shows other features that define the η class. The gateway residues Glu
106 and Thr
199 in HuCA2 are responsible for the orientation of the substrate during catalysis. Both TgCA_RP (Glu
239 and Thr
414) and PfCA (Glu
305 and Thr
500) are predicted to have these gateway residues in the appropriate position for substrate orientation (
14,
15) (
Fig. 1C, blue amino acids). TgCA_RP and PfCA lack the His
64 proton shuttle present in most α-CA active sites, which is responsible for increased catalysis via the transport of protons from catalytic intermediates to the environment (
Fig. 1A and
B, ′′′) (
16–18). This was also the case for the other apicomplexan orthologs that were aligned (
Fig. 1B). An alignment of the I‑TASSER predicted structure of the putative TgCA_RP CA domain (amino acids 121 to 446) (
19–21) with the crystal structure of HuCA2 (
22) illustrates the above features and their location in the predicted active site (
Fig. 1C).
Two prediction systems for glycosylphosphatidylinositol (GPI)-anchored proteins, PredGPI (
23) and GPI‑SOM (
24), predicted that TgCA_RP has a C-terminal GPI anchor cleavage/attachment consensus sequence (
Table S2). We also analyzed the sequences of TgCA_RP orthologs from related apicomplexan parasites. All of the orthologs, except for PfCA of
P. falciparum, were predicted to contain GPI anchor attachment sites, including orthologs of
P. yoelii and
P. berghei (
Table S2) (
23,
24). These predictions suggest that the GPI anchor modification of TgCA_RP and its orthologs may be conserved from a common ancestor of the Apicomplexa.
To investigate the CA activity of TgCA_RP, we expressed two truncated forms, rTgCA_RPa (amino acids [aa] 121 to 445) and rTgCA_RPb (aa 94 to 488), in
Escherichia coli. The truncated proteins were soluble, unlike the full-length protein, which was expressed and purified for antibody generation (see below). rTgCA_RPa contains the predicted CA domain, and rTgCA_RPb includes, in addition, the N-terminal region downstream of a cleavage motif, SXL↓Q (
91SLLQ
94), previously characterized in toxolysin, a rhoptry metalloprotease (
25). Both fragments were cloned in the Pet32Lic/EK vector, and the resulting constructs were transformed into
E. coli for expression. Nickel affinity purification of soluble protein products followed with the aim of measuring rTgCA_RP catalytic activity. Two protocols for activity were used: hydrolysis of
p-nitrophenyl acetate (pNPA) (
26) and in-gel CO
2 hydration assays (
27). Commercially available bovine carbonic anhydrase (Sigma catalog no. C3934) was used as a positive control for activity. We were unable to detect any measurable activity from rTgCA_RPa or rTgCA_RPb when measured at various pHs (pH 6.0 to 8.0) and buffer compositions (Tris-HCl, Tris-SO
4, Tricine-KOH, HEPES-KOH, morpholineethanesulfonic acid [MES], and morpholinepropanesulfonic acid [MOPS]) (data not shown). It was reported that site-directed mutagenesis of the Zn
2+ coordination site restored enzymatic activity of some α-type CARPs (
28,
29). We performed site-directed mutagenesis of rTgCA_RPa and rTgCA_RPb to alter the predicted Zn
2+ coordination residues to the His
94-His
96-His
119 of the α-CAs (as in HuCA2) or to His
299-His
301-Glu
320 as in PfCA, but these alterations did not restore activity to rTgCA_RP as measured with either activity assay (data not shown). In summary, sequence analysis and experimental results indicated that TgCA_RP lacks activity. However, we were unable to restore its activity by mutating specific amino acids of rTgCA_RP, making it difficult to definitively state that TgCA_RP is inactive.