INTRODUCTION
Isolated for the first time from a
Culex neavei mosquito in South Africa in 1959 (
1,
2), Usutu virus (USUV) was subsequently detected in different species of mosquitoes and birds throughout sub-Saharan countries (
1,
3). USUV is an Old World flavivirus included in the Japanese encephalitis virus (JEV) antigenic complex along with several human and animal pathogens (e.g., JEV, West Nile virus [WNV], Murray Valley encephalitis virus [MVEV], and Saint Louis encephalitis virus [SLEV]) (
4). Its genome is a single-stranded RNA molecule of positive polarity coding one long open reading frame (ORF) that is flanked by a type 1 capped 5′-terminal noncoding region (NCR) and a 3′-terminal NCR required for genome replication and translation. The polyprotein includes four 5′ structural genes (coding for capsid [C], premembrane [prM], membrane [M], and envelope [E]) and seven nonstructural (NS) genes (coding for the NS1, NS2A, NS2B, NS3, NS4A, NS4B, and NS5 proteins) produced as a result of co- and posttranslationally proteolytic processing by viral and cellular proteases (
4). The virus is transmitted and maintained in the natural cycle by mosquitoes (mostly of the
Culex genus) as vectors with birds as the main amplifying hosts, while humans are considered incidental or dead-end hosts. A very recent study showed that bats could also be infected by USUV and might act as amplifying hosts (
5). In 1996, USUV emerged outside Africa and caused deaths among Eurasian blackbird (
Turdus merula) populations in the Tuscany region of Italy (
6). In the following years, the virus could be detected in several Central European countries, including Austria, Hungary, Spain, Switzerland, Belgium, Czech Republic, and Germany (
7–16). In addition, USUV-specific seroconversion among birds has been demonstrated in England, Poland, and Greece (
17–19). Within the last years, there is increasing evidence of clinically apparent human USUV infections characterized by fever, rash, jaundice, headache, nuchal rigidity, hand tremor, and hyperreflexia (
20–24). The detection of USUV-specific antibodies in blood donors in Germany and Italy highlights the fact that USUV can also be transmitted to humans without causing any symptoms (
25,
26). However, nothing is known about the evolutionary dynamics of USUV and how the virus interacts with its various host and vector species. Although few studies related to USUV genetic diversity at the country level have been conducted, there have been no studies specifically addressing the evolutionary events responsible for adaptation to the hosts and spread of USUV, making it a successful pathogen responsible for neuroinvasive disease in multiple host species, including humans. To understand the evolutionary mechanisms of USUV, we obtained the complete genome sequences of 77 USUV strains sampled from a variety of host species (principally mosquitoes and birds). With these and all available USUV sequences with known geographic and temporal information data, we conducted an expansive analysis of USUV to assess the drivers and barriers of viral migration and the pattern of evolutionary dynamics and to trace the origin of the outbreaks, as well as movement patterns of USUV strains between continents/countries and transitions between host species.
DISCUSSION
The rate of globalization accelerates the migration of exotic pathogens and their hosts to new environments facilitating contacts with vulnerable new hosts (e.g., introduction, spread, and establishment of West Nile virus in North America; introduction and autochthonous transmission of chikungunya virus and Zika virus in the Americas). Thus, understanding evolutionary processes that contribute to the emergence, maintenance, and spread of viral diseases is the sine qua non to develop and implement surveillance strategies for their control. In this study, we sought to elucidate the possible origin, pattern of spatiotemporal dynamics, and eco-epidemiological factors that shape the evolution of USUV becoming a very successful pathogen responsible for neuroinvasive disease in multiple vertebrate species, including humans.
Our phylogenies showed an important spatial differentiation between Central European and African USUV that resulted in phylogeographic clustering of 6 distinct lineages. We found evidence that the phylogenetic structure is shaped by the geographic location and pattern of migratory flyways, which likely facilitates rapid long-distance virus dispersal. This demonstrates that the major migratory bird flyways (East Atlantic and Black Sea/Mediterranean [
Fig. 1; see
Table S1 in the supplemental material]) could predict the continental and intercontinental dispersal patterns of USUV in our data sets (long-distance linkages between African countries, and Africa-Europe supported by high BF values) and that the birds might act as potential long-distance dispersal vehicles. These results are comparable with the predicted dispersal pattern of West Nile virus across the United States via avian flyways (
30–32). A recent study revealed that the migratory status did not appear to influence WNV viremia titers (birds remain viremic for several days), as might be expected if individuals were immunosuppressed during migration. Furthermore, the infection does not inhibit migratory behavior, demonstrating that long-distance migratory birds are able to carry the virus over long distances (
33,
34). Likewise, there are many evidences of USUV-seroconverted migratory bird species detected throughout Europe, which use East Atlantic and/or Black Sea/Mediterranean flyways (
18,
35–37). However, the long-distance spread of the virus through ship- or aircraft-borne transportation of USUV-infected mosquitoes cannot be excluded.
The phylogeography results show a westward spread of USUV in Europe that matched the chronological and geographical incidence of USUV epizootics in Central Europe. The BSSVS model confirmed the multiple introduction of the virus into Europe from Africa, with Senegal as a possible origin for the progenitor of Central European epizootics (
Fig. 1 and
2). The inferred spread of USUV indicates that in Africa, Senegal was probably the major source population, whereas in Central Europe, Austria represented the primary source and Italy the source of USUV diffusion (
Fig. 1 and
3). The limited number of sequences from Africa reduces our ability to predict viral migrations within African countries. Given that many intervening countries are unsampled and long branches may obscure additional spatial movements of the USUV between Africa and Europe, all inferences of spatial connections between African countries should be interpreted with caution.
The existence of geographically distinct lineages in Europe (likely due to adaptation to the local ecological conditions and overwintering in enzootic foci) reflects the fact that USUV circulates in multiple areas that are separated from each other by geographic barriers such as climate, vegetation, different host species, and other unknown ecological conditions. Thus, the adaptation of USUV to naïve vector and host populations can lead to the emergence of local virus variants. The most likely scenario for European lineages might be enzootic maintenance (
in situ evolution) similar to that observed for West Nile virus and St. Louis encephalitis virus in United States (
30,
31,
38–41). This hypothesis is supported by the observation that European lineages form a star-like structure (population expansion after a single viral introduction) in which the variant viral strains accumulate changes during the rapid adaptation to the local ecological conditions (e.g., USUV epizootics in Germany and Austria). Furthermore, the phylogeny-trait association tests indicated a very strong geographical clustering of European lineages (
P < 0.001), supporting the
in situ evolution scenario. It has also been observed that the African lineages are driven mostly by extensive migration and introduction of viral variants from different geographic origins (e.g., emergence of 2 lineages in Spain).
The estimated substitution rate of 1.13 × 10
−3 substitutions site
−1 year
−1 is very similar to the rate 1.37 × 10
−3 given by Nikolay et al. (
42) for their analysis of the E gene using a much smaller data set. However, the mutation rates observed for E, NS5, and the complete genome are within the confidence interval estimated for other flaviviruses (
29), and the differences per gene are expected given their particular biological role.
Although USUV has received more attention only recently after the first epizootics and human cases in Europe, our estimates suggest that the virus emerged in Africa at least 500 years ago. This estimation is compatible with those observed for other members of the Japanese encephalitis virus group (500 to 1,500 years) (
39,
43,
44). It should be pointed out that the accuracy of the TMRCA of USUV could be influenced by several factors, including the limited amount of complete sequence data, the short and unbalanced time span, and the presence of many intervening unsampled African countries that can lead to underestimation of the lengths of long branches. However, the estimated TMRCAs of European lineages (~40 years) were similar in E, NS5, and complete genome MCC trees (
Fig. 1; see
Fig. S3 in the supplemental material). The apparent absence of virus activity for such a long period can be explained by several hypotheses: introduction of less virulent viral strains; cross-reactive flavivirus antibodies (heterotypic flavivirus antibodies as a result of frequent exposure to numerous flaviviruses [45]) among birds and other vertebrate hosts, which could have modulated or downregulated clinical illness and viremia, reducing the transmission (
45); the absence of long periods of hot and dry weather, which influence the abundance of competent vectors and transmission to susceptible hosts (
46); reduced population of susceptible hosts; herd immunity of resident bird populations supporting a silent spread of the virus (
47); and other extrinsic factors that could have influenced selection of less virulent virus strains.
A very recent study showed that USUV is able to emerge in other hosts (bats) (
5). This observation raised questions regarding the USUV host range and the ability to adapt to the new hosts. Due to limitations of the available sequences, mostly from African countries, the host species analysis should be interpreted with caution. While the coloring of branches in
Fig. 3 indicates the mosquito as the most probable host for the common ancestor of all USUV strains, this is unlikely to be reliable, due to the unbalanced sampling of different hosts. Nevertheless, that USUV originated in the mosquito is comparable with the fact that the mosquito is the main host for USUV. Transition from mosquitoes to birds and vice versa has been reconstructed (95% pp), being consistent with the enzootic cycle of the USUV. Although humans are considered incidental or dead-end hosts for USUV, adequate molecular surveillance is essential for public health (diagnostic and blood/organ donors) due to the increasing number of human USUV infections. Several genetic signatures were found in both African and European strains, which could be useful for development of molecular methods (e.g., real-time PCR) capable of differentiating USUV lineages. The virulence of the specific USUV lineages is currently unknown, and further studies are necessary to determine the biological characteristics of each lineage.
The overall low
dN/
dS ratio in the polyprotein gene indicates that most amino acid residues are subjected to purifying selection as results of genetic drift characteristic for arboviruses (
48). We found strong evidence of adaptive evolution only at codon 898 in the NS5 gene. Although the function of this residue remains to be determined, it is located adjacent to codon D896E observed in the human USUV strain, which was associated with viral replication efficiency and neuroinvasive capacity in certain strains of JEV and WNV (
25). In addition, a single amino-acid change in NS5 has been found to influence WNV replication in different hosts (
49). To determine whether NS5-898 may be involved in the replication of USUV in different hosts, further studies are required. We also observed that a larger number of sites in the USUV polyprotein might be subjected to positive pressure evolving under episodic directional selection, indicating past occurrence of positive selection. The purifying selection observed is expected given the transmission and infection modes of arboviruses allowing accumulation of synonymous mutations and negatively selected sites as the effect of alternation between the arthropod vector and avian or mammal hosts (
48). It is interesting to note that the pattern of increased positive selection in nonstructural genes compared to that in structural genes is similar to those observed in WNV, indicating that the host immune selection pressure does not caused increases in viral fitness (
31). Mutations observed at amino acid positions V91A (NS2A), L46F and T359K (NS3), and D67Y (E) have been found to be involved in the formation of Europe 1 to 3, while A120V (C), G320S and P376S (E), and A274T (NS5) are specific for African lineages (convergent evolution). Similar patterns of parallel or convergent evolution have been observed for WNV. This suggests that a limited number of residue changes are permitted due to functional constraints (
41). It is interesting to note that the V91A (NS2A) and L46F (NS3) mutations are specific for German strains and likely occurred due to introduction of USUV in this country. Although the impact of these mutations is unclear, similar changes in the related WNV modulated the host antiviral response by inhibition of interferon signaling (
50). The residue exchange E-G302S observed in human USUV cases is considered important because it might have played a role in the human-specific neuroinvasive capacity of this virus (
22). Our homology model suggested that each envelope protein domain is an important site for USUV evolution. In particular, specific amino acid substitutions within the DIII domain have been observed only in African lineages and in human cases from Italy (
Fig. 5). It is known that DIII of flaviviruses represents a receptor binding domain and a major determinant of virus cellular tropism (
51). Such mutations in WNV DIII have been involved in virus infectivity, virulence, antigenicity, and escape from neutralizing antibodies (
52). It is unknown if these mutations in DIII observed are aftereffects of the mechanisms of antigenic escape or further adaptation to the host, but they might confer an evolutionary advantage to the virus. Potential cleavage sites for generation of USUV proteins and cysteines were conserved in all USUV strains, suggesting that their biological roles are preserved (see
Table S2 and
S3 in the supplemental material).
In all USUV strains, the 5′ secondary structures were conserved, whereas highly variable size heterogeneity in the 3′ NCR was detected. Five distinct 3′ NCR patterns were detected (see
Fig. S6 in the supplemental material): three of them revealed long deletions in stem loops SL-I and SL-V, representing the hypervariable region of 3′ NCR. It has been shown that the variation in this region may have evolved as a function of dengue virus (DENV) transmission and replication in different mosquito and non-human primate/human host cycles (
53).
Viral adaptation in mosquitoes and vertebrate hosts by local overwintering or reintroduction of the virus and migratory bird flyways could be considered key determinants in the spatial dispersal and establishment of USUV. Thus, further studies preferentially based on complete genomes, including those from previously unsampled intervening countries, are clearly necessary to fully understand the impact of ecological/immunological/virological factors on USUV epidemiology and evolution in different ecological habitats. This should be more feasible in the era of next-generation sequencing.