INTRODUCTION
Both arms of the adaptive immune system help control influenza virus replication: antibodies neutralize virus (
1) and direct the clearance of infected cells (
2), while CD8
+ T cells kill infected cells that display viral peptides on their major histocompatibility complex (MHC) class I molecules (
3,
4). While antibodies against the viral surface protein hemagglutinin (HA) provide the most potent protection when they are well matched to the virus strain (
5–7), T cells offer broader protection against diverse strains since they tend to recognize epitopes in more conserved internal viral proteins such as nucleoprotein (NP) and matrix protein (M1) (
3,
4,
8,
9).
Studies in both mice (
10–14) and humans (
9,
15,
16) have shown that preexisting influenza virus-specific CD8
+ T cells reduce the severity of disease and enhance virus clearance. For instance, preexisting virus-specific CD8
+ T cells were correlated with decreased symptoms in humans infected during the 2009 H1N1 pandemic (
15). Similarly, T cells specific for NP were associated with a decreased incidence of symptomatic infection over a multiyear study of a large human cohort (
9), and CD8 T-cell responses were correlated with recovery from severe H7N9 infection (
16). Therefore, experimental and epidemiological work demonstrates that CD8
+ T cells contribute to immunity against influenza.
Because humans are repeatedly infected with influenza over their lifetimes, one might expect viruses to be under evolutionary pressure to accumulate substitutions in epitopes targeted by immune memory. Indeed, there are numerous examples of the fixation of antibody escape mutations in HA (
17,
18), consistent with the notion that this protein evolves under strong selection from antibodies. Several studies have also described influenza virus mutations that escape recognition by CD8
+ T cells (
19). In a mouse study, viral mutations arose that conferred T-cell escape in RAG-1-deficient mice expressing an influenza virus NP-specific T-cell receptor (TCR) (
20). Rimmelzwaan and coworkers identified the fixation of mutations in NP of human H3N2 virus that mediated escape from CD8
+ T cells by altering the epitope recognized by the T-cell receptor (
21–23) or by abrogating binding of the epitope to MHC class I molecules (
24). Valkenburg et al. described the emergence of CD8
+ T-cell escape mutations in a persistently influenza-infected infant (
25). These elegant studies demonstrate that influenza virus accumulates substitutions that escape CD8
+ T cells as well as antibody-mediated immunity.
However, these studies do not prove that positive selection for CD8
+ T-cell escape is an important driving force in the evolution of influenza virus, since many sites in the virus genome will fix substitutions given enough time (
26–28). To rigorously establish the presence of positive selection, the field of molecular evolution has developed statistical tests to discern whether a subset of sites is evolving faster than expected. Most of these tests compute nonsynonymous and synonymous distances (referred to as
dN and
dS, respectively) and then test for sites with statistical evidence that the accumulation of nonsynonymous substitutions exceeds that of synonymous substitutions (
dN/
dS ratio of >1) (
29,
30). These tests consistently find overwhelming evidence for positive selection in the antigenic sites of influenza virus hemagglutinin (
31–33) but little evidence for positive selection in CD8
+ T-cell epitopes (
33). One study reported that CD8
+ T-cell epitopes in NP have a higher
dN/
dS ratio than do other sites (
34); however, that study made a pairwise comparison of two sequences only and included no tests for statistical significance. Below, we describe the use of several state-of-the-art tests to verify that CD8
+ T-cell epitopes have neither an elevated frequency of sites with a
dN/
dS ratio of >1 nor an elevated rate of nonsynonymous substitutions. Therefore, by standard criteria, CD8
+ T-cell epitopes are not under positive selection.
The results of these statistical tests for positive selection seem at odds with the extensive body of experimental work described above. We hypothesized that the discrepancy arises because known CD8
+ T-cell epitopes are under strong functional constraint (
34–37). If epitopes are highly constrained, then even strong positive selection might fail to elevate the rate of nonsynonymous substitutions in epitopes above that at less constrained nonepitope sites. To address this possibility, we developed new statistical tests that take advantage of the fact that some lineages of human influenza virus are paralleled by lineages of swine influenza virus that are not under selection from human CD8
+ T cells. Using these tests, we show that CD8
+ T-cell epitopes in NP evolve significantly faster in human influenza virus than in swine influenza virus. Furthermore, we show that substitutions in these epitopes are enriched on the trunk of the phylogenetic tree, indicating that viruses that acquire them have a selective advantage that promotes their evolutionary spread. Overall, our work provides clear statistical evidence that complements prior experimental studies showing that CD8
+ T-cell epitopes are under selection in human influenza virus (
22) and suggests that the failure of conventional tests to identify this selection is due to high levels of functional constraint in epitopes.
DISCUSSION
We describe the first rigorous statistical evidence that CD8
+ T-cell epitopes are under positive selection in human influenza virus. Our work adds to a growing body of evidence suggesting an important role for T-cell immunity in shaping influenza virus evolution. Previous studies showed that T cells help protect against human influenza virus (
9,
15) and detailed specific instances of T-cell escape (
21–25). Our work shows that T-cell selection increases the rate at which mutations are fixed in epitopes of NP and indicates that viruses with these substitutions have a selective advantage that makes them more likely to fall along the trunk of the phylogenetic tree.
Our results also explain why conventional
dN/dS tests fail to detect positive selection in CD8
+ T-cell epitopes. Known human CD8
+ T-cell epitopes tend to be under strong functional constraint (
34–37). It is unclear whether this is because T cells inherently target conserved epitopes, because repeated infections preferentially boost T cells that target conserved epitopes, or because there is a bias toward experimentally identifying conserved epitopes. However, in any case, the fact that known epitopes are under strong constraint means that even fairly strong positive selection may not enhance the nonsynonymous substitution rate to a level detectable by conventional
dN/dS tests. This contrasts with antibody epitopes in HA, where the ability of
dN/dS tests to detect antibody-mediated positive selection is probably augmented by the fact that antigenic sites are disproportionately tolerant of point mutations (
68).
The novel approach that we developed ameliorates this problem by comparing the evolution of epitopes of human and swine influenza viruses or the entire phylogenetic tree and only its trunk. These comparisons should better control for site-to-site variation in functional constraint, since comparisons are always made between homologous sites that should be subject to similar functional constraints. Admittedly, there may also be other differences in functional constraints between human and swine influenza viruses beyond T cells, but unless these differential constraints are systematically biased toward occurring at T-cell epitopes, they should not alter the fundamental validity of our approach. By making comparisons in this way, we demonstrated clear positive selection in CD8+ T-cell epitopes in NP, both in human versus swine influenza viruses and in the trunk versus the entire phylogenetic tree.
One interesting aspect of our study is that we found positive selection in NP but not M1. This finding is consistent with a recent large-scale study that found that NP was the only protein for which the presence of preexisting memory T cells correlated with decreased rates of symptomatic infections (
9). However, our study does not preclude an important role of T cells targeting M1, which contains an immunodominant HLA-A2 epitope spanning residues 58 to 66 (
69,
70). One study argued that T cells targeting this epitope are ineffectual (
70), although this interpretation is disputed (
71,
72). However, experiments have also shown that this epitope is under strong constraint (
34). If an epitope is completely intolerant of mutations, it will of course be unable to accumulate substitutions regardless of the strength of selection. It remains unclear if our failure to detect positive selection in M1 reflects a lack of effective immunity targeting this protein or strong constraints that simply prevent the fixation of escape mutations.
It is well established that antibodies are strong drivers of repeated selective sweeps in the evolution of human influenza virus (
66,
73). The fact that we can detect positive selection by CD8
+ T cells even in the presence of these antibody-driven selective sweeps demonstrates the importance of T-cell immunity in driving viral evolution. The existence of such selection is consistent with modeling studies showing that T-cell immunity that reduces the infectious period can strongly favor viral escape (
74).
There is considerable interest in developing vaccines that elicit stronger T-cell immunity to better protect against diverse influenza virus strains (
3). Our demonstration of the evolutionary importance of T-cell selection suggests that this interest is well founded. In addition, our results suggest that attempts to forecast the seasonal evolution of influenza (
75,
76) could benefit from examining changes in T-cell as well as antibody epitopes.