INTRODUCTION
Human papillomaviruses (HPVs) comprise over 200 types. In accordance with their sequence alignment of the major capsid protein L1, they are divided into the 5 genera alpha, beta, gamma, mu, and nu. HPVs infect skin squamous epithelial cells (alpha and beta genus) and mucosal epithelial cells (alpha genus) in humans. An infection can either be asymptomatic or result in benign tumors or cancer. Cancer development occurs only in the rare cases of persistent infection and failure of viral clearance. However, so-called “high-risk” HPVs are classified as most carcinogenic by the International Agency for Research on Cancer (IARC) (
1,
2) because they are highly associated with cancer of the cervix (100.0%), oropharynx (30.8%), vulva (24.9%), vagina (78.0%), penis (50.0%), and anus (88.0%) (
3). The carcinogenic potential varies between the different HPVs. All high-risk types belong to the alpha genus but to the following four different species: alpha-5 (HPV51), alpha-6 (HPV56), alpha-7 (HPV18, HPV39, HPV45, HPV59, and HPV68), and alpha-9 (HPV16, HPV31, HPV33, HPV35, HPV52, and HPV58). HPV16 is associated with ∼50% of cervical cancer, and HPV18 is associated with 20%, whereas HPV31, HPV33, HPV35, HPV39, HPV45, HPV51, HPV52, HPV56, HPV58, HPV59, and HPV68 together are associated with ∼30% (
4–6).
Two viral oncoproteins, E6 and E7, which are always expressed in HPV-associated cancers (
7), are responsible for the immortalization of human keratinocytes, the target cells of HPV (
8). Current models suggest that protein-protein interactions of both viral oncoproteins with their cellular targets contribute to carcinogenicity. Most E7 proteins share the capacity to bind to retinoblastoma (Rb) family members as well as phosphatases PTPN14 and PTPN21 (
9–13) independently of the carcinogenic risk level of HPVs they belong to. In contrast, E6 proteins do not share any universal target cell protein conserved for all HPVs. However, binding to the E3 ubiquitin ligase E6-associated protein (E6AP), the tumor suppressor protein p53, and PDZ domain proteins are key interactions that are specifically displayed only for E6 proteins of high-risk alpha mucosal HPVs. The simultaneous recruitment of E6AP and p53 by E6 results in the degradation of p53 (
14,
15).
E6 is highly conserved among papillomaviruses and consists of two zinc-binding domains (
Fig. 1). An E6-based phylogenic classification depicts similar relationships for E6 proteins within the alpha genus as the common L1-based classification. Within the alpha-9 group, HPV16, HPV35, and HPV31 are the closest-related HPVs. Generally, E6 proteins bind to accessible LxxLL motifs of various cellular target proteins. The LxxLL-binding profile of different E6 proteins varies within the alpha genus (
16).
HPV16 is the HPV with the highest carcinogenic risk. It has been shown previously that 16 E6 forms a ternary complex with the E6AP-LxxLL motif and the p53 core domain (
17). Once formed, the E6/E6AP/p53 complex mediates ubiquitination and subsequent proteasomal degradation of p53 (
18). Among all cellular interaction partners of E6, this ternary complex is the best-characterized interaction in structure and function for 16 E6. Studies on binding parameters of the ternary complex depicted binding affinities of the 16 E6-LxxLL(E6AP) dimer to p53 in a micromolar range (
17) and defined crucial amino acids for complex formation and subsequent p53 degradation (
Tables 1 and
2).
Figure 1 shows a sequence alignment of E6 proteins of the alpha-9 genus and HPV18 (alpha-7) as the second most prevalent high-risk HPV. Amino acids of 16 E6, which significantly contribute to LxxLL(E6AP) or p53 binding are not completely conserved among the alpha-9 HPVs. In
Tables 1 and
2, these amino acids are compared regarding their conservation between 16 E6 and 31 E6. Despite the high sequence homology of 16 E6 and 31 E6 of 67% identity (ClustalO [
19]), both binding sites show sequence variations which can potentially interfere with target binding, the formation of the ternary complex, and subsequent p53 degradation. Additionally, quantitative assays using a luciferase-p53 fusion protein as a substrate for E6 proteins revealed that HPV16 E6 is more active in initiating p53 degradation than HPV31 E6 (
20). These data suggest that besides qualitative, also quantitative differences among E6 and E7 protein interactions may explain the different carcinogenic behavior.
HPV16 and HPV31 are closely related, belong to the same genus and species, and show consistence in phylogeny and pathology. However, it is not completely understood why HPV16 is far more carcinogenic than other high-risk alpha-9 HPVs. In this work, the crystal structure of 31 E6 was solved by X-ray crystallography. The binding properties of 31 E6 to p53 and LxxLL(E6AP) were analyzed quantitatively in comparison to 16 E6. Both HPVs represent high-risk types of alpha-9 genus infecting the mucosal keratinocytes but reveal a different p53 degradation potential in cell-based assays (
2,
6,
20). Presumably, the degradation of p53 is greatly related to carcinogenicity, and variations in p53 degradation could be one factor in this process. Hence, our structural and quantitative analysis bridges sequence, structure, and function together and, further, suggests explanations regarding the different p53 degradation potential of two very closely related HPV E6 proteins, both targeting the same cellular targets, E6AP and p53.
DISCUSSION
All high-risk HPVs inactivate the tumor suppressor protein p53 via E6AP-dependent proteasomal degradation, which promotes cell immortalization. However, they differ in their ability to degrade p53, which may affect carcinogenic potential. E6 recruits E6AP and p53 to form the ternary complex E6/E6AP/p53, which is required for p53 degradation. The inactivation of p53 via proteasomal degradation is based on the formation of the ternary complex E6/E6AP/p53 (
17). Differences in the assembly of this complex can alter p53 degradation efficiency. The scope of this work was to characterize the ternary complex of two very closely related alpha-9 high-risk HPV types, HPV16 and HPV31, in order to investigate whether (i) phylogenetic similarity results in structural conservation, and (ii) the binding of E6 proteins to the same cellular targets differs structurally and quantitatively between 16 E6 and 31 E6.
As expected, the overall structure of 31 E6 resembles that of 16 E6, with two zinc-binding domains E6N and E6C forming a binding cleft for LxxLL motifs (
Fig. 3). Due to the high sequence conservation, especially of the zinc-binding motifs of the HPV E6 proteins, this can very likely be claimed also for other HPV E6 proteins. The sequence alignment of HPV alpha-9 E6 (
Fig. 1 and
Table 1) shows that the LxxLL motif-binding site of E6 is highly conserved. Amino acid 16 E6 L50 in the hydrophobic LxxLL motif-binding pocket, which abolishes LxxLL(E6AP) binding if mutated (
27,
28), is conserved between 16 E6 and 31 E6 and within all alpha species. Additionally, mutations of 16 E6 R102 and R131 to alanine largely impair E6AP interaction. These amino acids are also conserved in alpha-9 HPVs and contribute to LxxLL(E6AP) binding by polar interactions. However, we found that the affinity of 31 E6 to the LxxLL(E6AP) peptide is 2-fold lower than that of 16 E6 to the same peptide. Structural comparison of 16 E6/LxxLL(E6AP) and 31 E6-LxxLL(E6AP) showed a slightly shifted E6C domain. Indeed, the two different conformations of the E6C domain can be related to the heterogeneous dynamic behavior of the E6C domain, which was previously reported in nuclear magnetic resonance (NMR) solution studies performed on various E6C domains (
29–31). The E6C domain is one building block of the LxxLL(E6AP)-binding cleft. Subsequently, flexibility of the E6C domain can be one reason for different binding affinities to the LxxLL(E6AP) peptide. Moreover, sequence differences between 16 E6 and 31 E6, as described in
Table 1, participating in LxxLL(E6AP) binding, result in less protein contacts in 31 E6. HPV16 E6, if mutated to the 31 E6-analogous amino acids (C51T, H78W, and R129G), resulted in a tremendously reduced binding affinity to the LxxLL(E6AP) peptide. These amino acids are not conserved in the alpha-9 genus at all (
Fig. 1). As a conclusion, minor amino acid variations are another possibility of the lower affinity of 31 E6 to LxxLL(E6AP).
Notably, 16 E6 mutants showing impaired binding to LxxLL(E6AP) also showed less efficient p53 degradation (
22,
28). As neither E6 nor E6AP alone are able to interact directly with p53 (
32–35), the binding to p53 requires the formation of the E6/E6AP complex. The binding of E6 to the LxxLL(E6AP) peptide is sufficient to recruit the core domain of p53 (
17). However, it was recently reported that additional binding sites at the N-terminal region of E6AP are necessary to stimulate the ubiquitin-ligase activity of E6AP by 16 E6 (
28). Here, we focused on one interaction site of E6 and E6AP, the LxxLL(E6AP) motif, necessary for p53 binding but not sufficient for p53 degradation. The structure and binding affinities of the functional complex in terms of p53 ubiquitination are still elusive.
The binding of E6AP to E6 is required prior to binding of p53 to E6. In order to investigate the binding of p53 to E6, we mimicked a “p53-ready” E6 by fusing the LxxLL(E6AP) peptide to the C terminus of E6 (
Fig. 2). In this proximity, the LxxLL(E6AP) peptide is bound to E6, and therefore, the measured binding of the p53 core domain is presumably independent of the required binding of LxxLL(E6AP). Of course, in the cellular environment, the sequential binding of E6AP and p53 to E6 finally determines p53 degradation.
Some amino acids (D44, F47, and D49), crucial for p53 core domain interaction in the 16 E6/LxxLL/p53 complex, are conserved within 16 E6 and 31 E6, suggesting that 31 E6 can bind to p53 (
Table 2).
However, the 31 E6 binding site shows striking amino acid differences compared to 16 E6. Of these amino acid variations, it was shown that 16 E6 mutants Q6A and Q14A (not conserved in alpha HPV) bind to E6AP and degrade p53 similarly to wild-type 16 E6
in cellulo (
17), indicating that variations at these positions have a minor influence on p53 binding and degradation
in cellulo. In contrast, the 16 E6 E18A mutant showed 75% lower binding to p53 and a decrease in p53 degradation efficiency (
17). Strikingly, in 31 E6, this position is an alanine residue (A18). Indeed, the mutation of A18 to E18 in 31 E6 resulted in an increased affinity to p53 core domain. This position may have an influence on p53 binding affinity and subsequent degradation, and accordingly, it is subject to variation across alpha species, where it is conserved neither in high-risk nor in low-risk HPVs. The gain of affinity of the 31 E6A18E mutant is rather low, indicating that additional variations between 16 E6 and 31 E6 influence the binding to the p53 core domain, like the observed shift in the E6C domain or other sequence variation. For example, our structural analysis indicates that the sequence variation 16 E6 Y43 and 31 E6 L43 is compensated by 16 E6 I23 and 31 E6 Y23, retaining the hydrophobic pocket for p53 binding. However, slight variations in the p53-binding pocket can also lead to different affinities. Overall, the sequence differences and structural analysis parallel the obtained 5.4-fold lower
Kd for 31 E6-LxxLL(E6AP) binding to the p53 core domain.
The formation of the ternary complex is presumably stronger for 16 E6 because it shows higher affinities to both LxxLL(E6AP) and the p53 core domain. Our affinity analysis strongly agrees with the previously reported >2-fold less efficient degradation of p53 in HPV31 E6-transfected cells compared to 16 E6 transfected cells, even though 31 E6 shows an almost 3-fold higher cellular level than 16 E6 in these experiments (
20).
HPV18 is the second most prevalent HPV associated with cervical cancer and belongs to the alpha-7 HPV species. The intracellular level of 18 E6 resembles 16 E6, but it shows almost 2-fold less efficient p53 degradation (
20). Overall, 18 E6 shares less sequence identity (∼57%) with 16 E6 than to 31 E6 with 16 E6 (∼66%). Slight structural differences, e.g., the position of the E6C domain, are not predictable but can change the binding to the p53 core domain and E6AP. 18 E6 does possess all crucial amino acids necessary for p53 core domain binding; only 16 D44 is found as the homologous amino acid E in 18 E6. On the other hand, LxxLL(E6AP)-binding 16 R131, which shows ∼50% reduced binding to LxxLL(E6AP) if mutated to A, is not conserved in 18 E6 (18 E6 H131; see
Fig. 1). These variations can potentially influence the efficiency of ternary complex formation and subsequent p53 degradation. Low-risk HPVs already show much lower sequence identity to 16 E6 (e.g., 11 E6, 36%) and already possess amino acid differences which neglect binding to p53 (e.g.11 E6 has no conserved 16 E6 E18, D44, F47, or D49, crucial for p53 core domain binding). Consequently, low-risk HPVs are inactive in E6AP-dependent p53 degradation altogether (
20).
It is important to note that, apart from defined crucial amino acids in 16 E6 (
Tables 1 and
2), the individual subset of minor sequence differences and the flexibility of the E6C domain position influence E6 structure and the binding to E6AP and p53. These sequence variations increase with decreasing phylogenetic relations of E6 proteins. Their impact on the E6 structure and binding affinities is not predictable. As a conclusion, binding affinities certainly vary between the E6 proteins but must be analyzed individually.
Further, it must be pointed out that alpha-9 high-risk HPV52 and HPV58 E6 proteins show higher p53 degradation efficiencies
in cellulo than 16 E6 despite similar intracellular protein levels of E6 proteins (
20), but they possess a lower carcinogenic potential. Here, p53 degradation potential does not correlate with the cancerogenic risk (
20,
36). The physiological context likely represents a more complex situation. Apart from p53 degradation, many other factors contribute to viral persistence and HPV-associated cancer, which further differ among different HPV genera, species, and types. Carcinogenicity of HPV may be influenced by many parameters, including the entire viral interactome of E1 to E7, transcription regulation, half-life of proteins, and deregulation of posttranslational modifications (
37–39), all playing a role in DNA damage response (reviewed in references
40–42), persistence, and immune response (
43; reviewed in references
44 and
45); E6-mediated degradation of other cell proliferation regulatory proteins, e.g., NHERF1 (
27); and still elusive factors. Notably, human keratinocytes can be immortalized by 16 E7 alone (
46). Coexpression with 16 E6 increases the immortalization rate (
47), highlighting the important concomitant role of E7 in HPV-associated carcinogenesis. Nonetheless, the inactivation of p53 remains a very important process with respect to cell immortalization. A 16 E6 mutant deficient in p53 interaction showed tremendously decreased potency of cell immortalization through being coexpressed with 16 E7 (
22).
Further, the multiple interactions of E6 play important roles in cell immortalization (
48). E6 proteins can bind to various LxxLL motifs of other cellular targets, e.g., human telomerase reverse transcriptase (hTERT) (
49) and interferon regulatory factor 3 (IRF3) (
50,
51). Additionally, E6 PDZ-binding motifs (PBM) differ even within HPV species (reviewed in reference
52). The last 4 amino acids of the PBM of 31 E6 (ETQV) differ only slightly from 16 E6 (ETQL). However, it was shown previously that HPV18 E6 (ETQV) has a different PDZ-binding profile than16 E6 (
53). Assumingly, the PDZ-binding profile of 31 E6 is also different from 16 E6. It was shown that an interaction of E6 with the PDZ-containing protein MAGI-1 results in degradation of MAGI-1 for carcinogenic as well as noncarcinogenic types (
36,
54) of alpha papillomaviruses in a very similar efficiency. The authors draw the conclusion that MAGl1 degradation alone cannot result in carcinogenesis but can in concert with p53 degradation and hTERT (
55). In conclusion, different PDZ-binding partners can also influence the carcinogenic potential of E6.
The interactome of both oncoproteins E6 and E7 facilitates cell transformation. The p53 degradation potential is one important factor in carcinogenesis, especially for high-risk types. Semiquantitative analysis revealed a link between carcinogenicity and p53 degradation (
20) for some HPVs such as HPV16 and HPV31. Further, HPV16 is associated with ∼50% of cervical cancers; in contrast, HPV31 is only associated with 3 to 8% (
56,
57). This difference is even more significant in HPV-positive tumors of the oropharynx, where HPV16 accounts for 93%, and HPV31, together with 12 other HPVs, accounts for 4% of these cancers (
58). In this context, it is interesting that 16 E6 and 31 E6 share the same structural fold, but 31 E6 forms the ternary complex E6/E6AP/p53 less efficiently. Consequently, the E6-mediated proteasomal degradation of p53 can be impaired. In principle, these findings are likely conferrable to the alpha-9 species and beyond and are not limited to the proteins of the ternary complex analyzed here. In summary, in addition to the diverse interactions of E6 with different interaction partners (qualitative differences, such as 16 E6 binds to E6AP and 8 E6 binds to MAML1 [
16,
59,
60]), divergence of E6 proteins could also be explained by different affinities of very closely related E6 proteins to the same cellular targets [quantitative differences, such as 16 E6 and 31 E6 bind to LxxLL(E6AP) and p53 with different affinities].
ACKNOWLEDGMENTS
We gratefully thank Joerg Martin and Lorena Voehringer (MPI Tuebingen) for their kind support with MST measurements. X-ray data collection was performed on the PXIII beamline at the Swiss Light Source synchrotron, P. Scherrer Institute, Villigen, Switzerland. We thank V. Olieric and C.-Y. Huang for their help on the beamline.
Further, this work received institutional support from le Centre National de la Recherche Scientifique (CNRS), Université de Strasbourg, Institut National de la Santé et de la Recherche Médicale (INSERM), and Région Alsace. The work was supported in part by grants from Ligue contre le Cancer (équipe labelisée 2015 and fellowship to A.B.), Ligue contre le Cancer CCIR-GE, ANR (Infect-ERA program, project HPV motiva), Fondation recherche Médicale (fellowship to A.B.), National Institutes of Health (grant R01CA134737), Instruct (ESFRI), and the French Infrastructure for Integrated Structural Biology (FRISBI, ANR-10-INBS-05) and Instruct-ERIC. We declare that the content is solely our responsibility and does not represent the official views of the National Institutes of Health.
G.G. was supported by the Post-doctorants en France program of the Fondation ARC Pour La Recherche Sur Le Cancer.
M.C.C., G.G., A.B., A.C.-S., J.L., F.S., T.I., G.T., and C.S. did the experimental design and data analysis/interpretation; M.C. and G.G. carried out the fluorescence polarization; I.S., M.C., A.C.-S., A.M., and G.G. did the crystallography; C.S. performed the MST-DIF; and M.M. did the GPCA-MC.