As the original MTase construct (aa 4–278) did not crystallize, we generated another that includes the complete N terminus but is devoid of the linker between the MTase and RdRp domains (aa 1–264; PDB code
5M5B ). This construct allowed determination of the crystal structure of the ZIKV MTase domain. The structure was solved by molecular replacement using the DENV3 NS5 MTase/SAM (PDB code
3P97 ) as the template model and refined to a resolution of 2.01 Å, with
Rwork and
Rfree values of 16.0% and 19.1%, respectively. The crystal belongs to space group P1 with the following unit cell parameters:
a = 37.6 Å,
b = 64.1 Å, and
c = 72.0 Å and α = 113.1°, β = 97.8°, and γ = 92.0°. Data collection and refinement statistics are reported in
Table 1. The ZIKV MTase domain crystallized with two molecules in the asymmetric unit (
Fig. 2A), whereas size exclusion chromatography indicates that the protein behaves as a monomer in solution. The low-interaction surface area (about 500 Å
2) suggests that dimerization occurs during crystallization as previously observed for Modoc virus (MODV) MTase (
32). The two protomers (A and B chains) are similarly folded, with significant differences arising from disorder in residues 43 to 58 of chain B (
Fig. 2A). A superimposition of 244 Cα atoms in the A and B chains of the protein resulted in an RMSD of 0.52 Å. Chain A consists of a canonical MTase core (residues 61 to 229, construct numbering PDB code
5M5B ) folded into a seven-stranded β-sheet (β1 to β7) surrounded by four α-helices (αX, αA, αD, and αE), as shown in
Fig. 2A and
C. Appended to the core are an N-terminal extension (residues 8 to 60) and a C-terminal extension (residues 230 to 272). The N-terminal subdomain comprises a helix-turn-helix motif followed by a β-strand and an α-helix (A1, A2, B1, and A3). The C-terminal subdomain consists of an α-helix followed by a β-strand (A4 and B2). Inspection of the electron density maps reveals strong additional electron density in the SAM binding pocket, allowing the unambiguous modeling of SAM. Because no SAM is added during purification or crystallization, its presence in the structure must have originated from
E. coli. The SAM molecule is bound in the central cleft formed of β-strands β1, β2, and β4 and α-helices αX and αA (
Fig. 2A and
B). The SAM adenine base is accommodated within a hydrophobic pocket defined by the side chains of Val138, Phe139, and Ile153 and stabilized by hydrogen bonds with a carboxylic oxygen from Asp137 and the nitrogen main-chain atoms of Lys111 and Val138 (
Fig. 2B, left panel). The ribose moiety is hydrogen bonded to the Glu117 carboxyl group. The sugar is also bridged via a water molecule to the side chain of Glu117. The methionine tail of the SAM molecule is positioned by interactions with the side chains of Ser62, Trp93, and Asp152, as well as with the main chain of Gly92. The methionine carboxylate is also bridged via a single water molecule to the side chain of Glu117 and to the backbone of Arg90. Three molecules of sulfate interact with chain A. One of them is bound via a cluster formed by arginines Arg43, Arg47, Arg63, and Arg90 (
Fig. 2D). The structure of chain A is highly similar to those of DENV-3 MTase (RMSD of 0.79 Å with PDB code
3P97 chain A) and of ZIKV MTase recently determined by Coloma et al. (RMSD of 0.29 Å with PDB code
5KQR chain A) (
22). There is no difference in SAM binding with the latter structure (PDB code
5KQR ), and the sulfate bound via a cluster formed by arginines Arg43, Arg47, Arg63, and Arg90 overlaps perfectly with the phosphate described by Coloma et al. (
22). Structural differences from DENV-3 MTase are limited to small changes in the conformation of three solvent-exposed loops (residues 55 to 58, 178 to 183, and 251 to 254), as shown by superimposition of several structures (
Fig. 2C).
We observe significant conformational changes between chain A and chain B of ZIKV MTase (RMSD of 0.52 Å) likely favored by their different crystal environments. Indeed, the chain B environment does not allow this loop to adopt the conformation observed in chain A. However, it is unlikely that crystal packing is the only factor involved since similar disorder has been observed, under different crystal packing, for the MODV MTase (PDB code
2WA2 ) (
32). We conclude that the intrinsic flexibility of this loop, at least in the absence of the RNA substrate, has also contributed to building of this crystal form. As illustrated in
Fig. 2A and
D, three main differences are noteworthy in chain B, as follows. (i) There is no supporting electron density for residues 43 to 58 (part of the B1 β-strand, the A3 α-helix, and the following loop (
Fig. 2A]). A similar absence of electron density has already been described for one of the two chains of the MODV MTase (
32), suggesting a certain degree of flexibility in this region. (ii) Downstream of the aa 43–58 flexible region, the αX α-helix is kinked (
Fig. 2D), with its N-terminal part (residues 61 to 67) taken away from this helix axis. (iii) There is no additional electron density characterizing the methyl donor SAM in the SAM binding site. Interestingly, in chain B of the MODV MTase, a SAM molecule is present, even if the homologous region aa 35–56 appears to also be disordered. However, the kink observed in the αX α-helix of ZIKV MTase chain B is absent from MODV MTase chain B. The kink has two main structural consequences. First, the side chain of Ser62 cannot interact anymore with the methionine tail of the SAM, as observed in chain A (
Fig. 2B, left panel). Second, a cluster formed by arginines of the RNA binding groove (Arg43, Arg47, Arg63, and Arg90) in chain A is distorted in chain B and does not accommodate any sulfate molecule (
Fig. 2D). In the absence of SAM, two polar molecules, a sulfate and a glycerol close by (∼3.2 Å), are held in the SAM binding pocket of chain B (
Fig. 2A and
B). The sulfate molecule is positioned by hydrogen bonds with the side chain of His116, the main chain of Glu117, and with the bound glycerol nearby. It is also bridged via a single water molecule to the main chains of Gly112 and Glu117, as well as to the side-chain hydroxyl of Thr110. Moreover, the glycerol molecule participates in van der Waals interactions with the main chains of Gly87 and Lys111 and is stabilized by hydrogen bonds with the sulfate and the main-chain amide of Lys111. The glycerol is also bridged via two water molecules to the main chains of Asp137 and Val138 for the first one and to the main chains of Gly87 and Asp152 for the second one. Such a structure with compounds bound in the SAM binding site could serve as a basis for structure-based drug design.