INTRODUCTION
Leishmaniasis is a neglected tropical disease caused by protozoan parasites of the genus
Leishmania. Visceral leishmaniasis (VL), the most severe form, is caused by
Leishmania infantum in South America, the Mediterranean basin, the Middle East, and Central Asia and by
Leishmania donovani in East Africa and the Indian subcontinent. VL is characterized by fever, weight loss, hepatosplenomegaly, anemia, pancytopenia, hypoalbuminemia, and hypergammaglobulinemia and is frequently fatal if untreated (
1,
2). Severe disease presents as hepatic dysfunction with jaundice, edema, and dyspnea (
3). Death is usually associated with hemorrhagic phenomena, which could be caused by bacterial coinfection (
4,
5) as a consequence of cytokine-driven systemic inflammation (
6).
Leishmania/HIV results in greater risks of relapse and fatality (
7,
8). Despite the range of clinical presentations, the parasite factors that determine differential virulence in humans are not well understood.
There are approximately 5,000 VL cases/year in Brazil, 90% of the cases registered in the Americas (
9–12). There are large numbers of asymptomatic
L. infantum infections, with various studies identifying 1 to 6% of the general population in regions of endemicity (
7,
13,
14). VL case fatality rates in Brazil average 7%, the highest in the world, and show an increasing trend (
12). Hence, understanding the factors that lead to severe disease and mortality is a priority for leishmaniasis research. Disease outcomes are known to be associated with patient age, sex, comorbidities (
4), adverse effects of treatment, and coinfections such as HIV (
15). Genome-wide association studies (GWAS) have shown to human genotypes influence VL disease severity in East Africa, Brazil, and the Indian subcontinent (
16), including a strong influence from the HLA-DR-DQ region within the major histocompatibility complex (
17). However, apart from studies of drug resistance (
10,
18), the influence of natural parasite genetic variation on disease severity has not been investigated using genome-scale methods.
To investigate this, we collected 109
L. infantum isolates from the states of Piauí and Maranhão in Brazil, which have large VL burdens (
19,
20). We used genome-scale quantitative genetic analyses to investigate the effects of parasite genotype on multiple clinical indicators of disease severity and mortality. Our analysis showed that parasite genotype influences multiple aspects of disease severity and has a strong relationship with patient mortality. We identified multiple parasite genetic loci that affect mortality. These virulence factors are key points for vaccine and drug discovery (
21) that will be complementary to reverse vaccinology methods (
22).
DISCUSSION
We used genome data from 109
L. infantum isolates from northeastern Brazil coupled with detailed patient clinical data to study the effects of parasite variation on VL disease. We found that the chance of mortality and patient sex were strongly associated with parasite genotype (heritability, 0.83 ± 0.17 and 0.60 ± 0.27, respectively). Although our sample size is very small, we found multiple associations that are supported by permutation-derived thresholds for several clinical traits that are not correlated with one another. Both these significant GWAS associations (
Fig. 3) and the inflation of
P values in the SNP QQ plots indicate that multiple genetic factors contribute to different aspects of clinical outcomes. While this sample size is very small, similarly small sample sizes have identified significant association with microbial GWAS (
28,
29).
We expect that disease outcomes and clinical manifestations are a combination of the interaction between the parasite, patient, environmental factors, and treatment. While no study has investigated the effects of human genetic variation on disease severity or mortality, several studies have evaluated the effects of human genetic variation on both visceral and cutaneous leishmaniasis susceptibility using GWAS (
17).
The VL analysis identified a single locus with high significance; the human leukocyte antigen (HLA) haplotypes in the HLA-DR-DQ region (
P = 2.76 × 10
−17) were the only genetic determinant of susceptibility to VL (
17) that was consistent between populations in Brazil and India. Much weaker association hits (top hits at a
P value of <5 × 10
−5) were observed for susceptibility to cutaneous leishmaniasis (CL). Our analysis of parasite genetic variants is entirely consistent with these results, if we consider that host-parasite interactions determine clinical severity. The most striking result of our analysis is the magnitude of the contribution from the parasite genotype that we detected, without considering the human genotype. We consider this result reasonable from two perspectives. First, the effects of human genotypes on disease severity have not been examined directly (only disease susceptibility), and second, it is the parasite that causes disease, while host genotypes merely react to, and modify, disease severity.
Parasite-derived drug resistance and drug pharmacokinetics are also important factors (
30). Genome-scale approaches have identified loci in
L. donovani that are associated with antimonial resistance, which affect clinical outcomes (
31,
32). Similarly, the low efficacy of miltefosine in Brazil, compared to initial efficacies in the Indian subcontinent, has been associated with the deletion of the miltefosine sensitivity locus (MSL), which that has been detected only in
L. infantum isolates from Brazil (
10). In our analysis, we did not observe any effects of MSL gene deletions on mortality or any other trait (GWAS
P values for mortality for all MSL genes > 1 × 10
−2), consistent with MSL being related to miltefosine resistance rather than mortality. The lack of association between the MSL locus and mortality is expected for our data set, as none of the evaluated patients were treated with miltefosine.
We consider it unlikely that the associations of mortality and sex with parasite genotype described here are due to drug resistance. This cohort is not ideal for such an analysis, since patients were treated with a variety of drugs, depending on disease severity (meglumine antimoniate, amphotericin B, liposomal amphotericin B, or pentamidine) (
Table S3). Deaths from VL usually happen in the first week of admission. Drug treatments, even when resistance is present, are effective at this point, and resistance is noted only weeks after the beginning of the therapy. At present, no clear relationship between treatment failure and susceptibility to these drugs has been demonstrated in Brazil. Preliminary
in vitro susceptibility analysis with some of the
L. infantum isolates examined here confirms the lack of association between patient mortality and parasite drug susceptibility (M. Lima, unpublished data). However, host sex is implicated in VL incidence (
33,
34) and increased parasitic load (
35) but not in host mortality (
4,
5). Therefore, the links of
L. infantum genome to mortality and sex seems to follow distinct pathogenic pathways.
Our GWAS analysis indicated that a variety of different genetic variants were associated with clinical outcomes (
Table 2). We discuss the potential implications of each observed SNP/indel and GCNV in supplemental texts B and C (
https://figshare.com/projects/Brazil-VL-GWAS/126331), respectively. While we estimate that the parasite genotype has a strong influence on the chance of mortality (
h2 = 0.83 ± 0.17; 95% confidence interval, 0.66 to 1.00), and some proportion of this effect appears to be derived from SNPs (
Fig. 2), we did not observe any SNPs that were significantly associated with mortality (
Fig. 3). This is possibly due to mortality being influenced by multiple alleles, each with small effects. While the significance values in our GWAS analysis appear fairly modest compared to GWAS analysis of human genetic variants that affect VL susceptibility (
17), several considerations need to be taken into account. First, we have a smaller sample size (
n = 109), so we cannot expect such strongly significant associations from a GWAS. Our results remain significant after permutation, because the
Leishmania genome is smaller than human/dog genomes, which reduces the burden of multiple test correction. Nevertheless, some of our associations are very strong, particularly when covariate analysis is carried out. The gene LINF_340051500 was supported by a robust
P value of 6.4 × 10
−9 when Kala-Cal** was used as a covariate. Similarly, the genes LINF_270006900 and LINF_330042400 were supported when each trait besides Kala-Cal* and Kala-Cal** was simultaneously used as a covariate (
P values of 5.9 × 10
−7 and 1.19 × 10
−6, respectively) (
Table S9), and LINF_270006900 was also supported in all cases when each other trait was individually used as a covariate (
P values ranging from 8.08 × 10
−7 to 1.25 × 10
−8) (
Table S9).
Many of the variants associated with clinical outcomes defy mechanistic explanation. This is unsurprising given the poor state of basic knowledge about
Leishmania-host interactions. Intergenic copy variants are particularly challenging to explain mechanistically, as the function(s) of intergenic regions in
Leishmania genomes is not well understood. However, intergenic regions in the human genome and yeast genomes are frequently associated with phenotypes, consistent with known functional elements within intergenic regions of these species (
29,
36).
We found no associations between chromosome number (CCNVs) and phenotypic traits. This is in accordance with previous work suggesting that karyotypic levels change rapidly between insect and mammalian hosts and also when parasites are cultured, while GCNVs are drivers of long-term adaptation in the field (
26). As the DNA from the parasites was obtained after only two passages, and significant changes in parasite virulence due to culturing are observed only after 20 to 30 passages (
37), we believe that SNPs and GCNV alterations due to culturing would be minimal. Furthermore, even if small (SNP/indel) variations are altered within cultured parasites, such variations were previously associated with a reduction in virulence, which is restored with passages in mice (
37,
38). Hence, genetic changes with parasite culture would be expected to reduce the strength of our GWAS/heritability associations between parasite genotype and clinical outcomes, rather than create artifactual ones. The strong relationships between parasite genotype and clinical factors observed in the present work indicate that the genome data obtained from cultured parasites relate meaningfully to outcomes.
We envisage several implications of our results. Most importantly, the strong influence of parasite genetic variation on mortality indicates that
L. infantum isolates vary in their virulence, so the risk of mortality depends at least in part on the isolate that causes the infection. Variation in parasite virulence within Brazilian
L. infantum isolates may explain the range of clinical VL symptoms, from symptomatic infections (
39), to rare occurrences of highly virulent infections that result in a high fatality rate for VL in Brazil (
40). While it may be possible to use parasite genotypes as early predictors of severe infections, this is unlikely to be economically viable or practically feasible at present, given the complexity of the genotypes associated with mortality and the requirement for parasite culture. It is more practical to use clinical data for risk assessments, which are well-powered and feasible in resource-poor settings (
5) (see Fig. S14 at
https://figshare.com/projects/Brazil-VL-GWAS/126331). However, studies of this type will lead to an enhanced understanding of the molecular mechanisms that lead to severe VL disease. The pathway to this understanding will be a combination of assessing genetic markers, the parasite genes that are implicated, and the host genes enrolled in the immune responses that the different alleles elicit. As many
Leishmania genes lack detailed gene functional annotations, laboratory analysis will be required to obtain mechanistic insights.
ACKNOWLEDGMENTS
This project was undertaken on the Viking Cluster, which is a high-performance compute facility provided by the University of York. We are grateful for computational support from the University of York High Performance Computing service, Viking, and the Research Computing team. We also thank The Genomics & Bioinformatics Laboratory at the University of York for assistance with genome sequencing. We thank Natan Portella Tropical Disease Institute for providing access to the patients and clinical data.
We state that no conflicts of interest exist.
C.A.G. performed genomics and analysis, contributed to writing of the manuscript, and verified the underlying data. K.S.S.C. performed isolate culture and DNA extraction for 65 L. infantum isolates and obtained genomic sequences. M.I.S.L. performed isolate culture and DNA extraction for 14 Brazil samples and phenotypic and genetics analysis, verified the underlying data, and contributed to writing of the manuscript. V.C.S. performed isolate culture and DNA extraction for 30 Brazil samples, obtained genomic sequences, and took care of the parasite collection. J.L.R.-C. performed ploidy and CNV analysis, contributed to writing of the manuscript, and verified the underlying data. M.J.B. performed ploidy and CNV analysis and contributed to writing of the manuscript. S.F. contributed to the variant calling pipeline. C.d.M.P.e.S.d.A. performed sample collection and clinical diagnosis, treatment, and follow-up of patients. D.L.C. contributed to collecting and gathering patient clinical data. J.C.M. obtained funding, contributed to study design, and contributed to writing of the manuscript. D.S. provided support and advice with heritability and GWAS. D.C.J. obtained funding, developed study design, performed analysis, verified the underlying data, and contributed to writing of the manuscript. C.H.N.C. obtained funding, conceived the study, verified the underlying data, and contributed to study design, collection of patient clinical data and isolates, and writing of the manuscript.
We thank the Brazilian Legislators that provided funds for the Ministry of Health destined for this study. C.A.G. was supported by MRC Newton as a component of the UK:Brazil Joint Centre Partnership in Leishmaniasis to J.C.M. (MR/S019472/1). J.L.R.-C. and D.C.J. are supported by a MRC New Investigator Research Grant to D.C.J. (MR/T016019/1). S.F. was supported by a Wellcome Trust Seed Award in Science to D.C.J. (208965/Z/17/Z). M.I.S.L. and C.d.M.P.e.S.d.A. are supported by Coordination for the Improvement of Higher Education Personnel (CAPES; Finance Code 001 and PROCAD-Amazônia 001-21/2018) and Foundation for Research and Scientific and Technological Development of Maranhão (FAPEMA). C.H.N.C. was supported by the National Council of Scientific and Technological Development (CNPq). The Brazilian Ministry of Education supported this research.