Aspergillus fumigatus is a saprotrophic mold fungus commonly found in soil enriched with organic material (
49). This fungus produces huge numbers of airborne spores, which are found ubiquitously in the environment, including the air column (
15). Despite having its primary niche in organic material in soil,
A. fumigatus has become a major pathogenic organism in humans, coincident with compromised or suppressed host immunity (
6). In the immunocompromised host, inhaled spores can initiate serious invasive aspergillosis, a condition carrying a prognosis of at least 50% mortality even when antifungal drugs are administered (
29). Vulnerable individuals cannot be completely protected from airborne spores; even HEPA-filtered air in bone marrow transplant units may contain four
Aspergillus spores per m
3, which may be brought in by the patient or staff (
28). Ascertaining relationships between the epidemiology of aspergillosis and the population structure of
A. fumigatus is therefore desirable, and attempts have been made using a variety of genotyping methods, although with little success in discerning population structure within the species (
52).
A search of the incompletely sequenced
A. fumigatus genome revealed some of the genetic elements that may permit a sexual cycle (
38). The sexual genetic elements in
A. fumigatus constitute a putative heterothallic system. The original genome-sequencing strain AF293 has the high-mobility-group domain protein at the
MAT locus (
36). A survey of 290 isolates revealed distributions of strains comprising either the high-mobility-group protein (
MAT1-
2) or the complementary alpha box domain protein (
MAT1-
1) at the
MAT locus at either 57% or 43%, respectively (
37). In the present study, we describe the development of a seven-locus MLST scheme for
A. fumigatus and discuss the findings from MLST analysis of 100 clinical and environmental isolates. In addition, we have determined the distribution of sexual idiomorphs among our collection. The results show a low level of sequence variation between most isolates of
A. fumigatus, suggesting a relatively recent evolutionary origin for the species.
DISCUSSION
The publication of the
A. fumigatus genome sequence (
36) is a driver for molecular genetic studies and has already helped correct taxonomic assignments based on phenotypic traits. Phylogenetic analysis revealed a sibling species of
A. fumigatus, designated
A. lentulus, in a study that compared sequences at five loci in slow-sporulating variants with those of Af293 (
2), the
A. fumigatus strain used for whole-genome sequencing (
36). Sequence comparison based on part of the β-tubulin gene and 18S rRNA revealed that phenotypically atypical strains previously thought to be
A. fumigatus may be a separate, more recently evolved species (
24). Cryptic speciation within
A. fumigatus was revealed following a microsatellite analysis of 63 isolates (
39). In a recent study, misidentified
A. fumigatus isolates were reclassified as
A. lentulus and
A. udagawae on the basis of restriction fragment length polymorphisms (RFLP) (
3). None of our 100 isolates was slow to form conidia, the main phenotypic differentiator for
A. lentulus and
A. udagawae, and PCR/StyI testing confirmed that the isolates were
A. fumigatus. In our experience with MLST analysis of many fungal species, we have never encountered examples in which isolates of another species gave identical PCR products with all the MLST genes when set up for sequencing reactions. This information therefore serves as a double check on species identity.
Comparison of nucleotide sequences of organisms is the most unequivocal method by which strains of any microorganism can be differentiated (
50). We therefore devised a seven-gene MLST scheme to genotype
A. fumigatus. Housekeeping genes possessed very low numbers of polymorphisms in this species and therefore were not useful for designing an MLST scheme. The infrequency with which we found polymorphisms within many genes scattered throughout the genome is a general indicator that
A. fumigatus has low interstrain variation in its genome, suggesting recent evolution relative to that of other fungal species. MLST is, in effect, the successor to multilocus enzyme electrophoresis (MLEE). MLEE has previously been applied to
A. fumigatus. Rodriguez et al. (
42) found 48 electrophoretic types from 91 isolates tested with 12 polymorphic loci. Bertout et al. (
5) differentiated eight electrophoretic types from 50 isolates based on seven loci. These MLEE results therefore support our finding from MLST that the level of interstrain variation for coding regions of genes is low. Sequence-specific DNA primer analysis distinguished 22 genotypes from 51 isolates, with a discriminatory power of 0.96 (
34), and another sequence-specific DNA primer study found 19 genotypes in 81 isolates (
46). Once again, typing systems based on sequence differentials failed to show high discriminatory power with
A. fumigatus.
The total number of polymorphic sites from all seven MLST fragments in our system was 41 (Table
3); therefore, 1.35% of the 3,038 nucleotides from seven MLST fragments have polymorphisms. This “SNP return” is much lower than that for MLST with other pathogenic fungi, such as
C. albicans, presently with 172 SNPs (6.0%) among the 2,883 bases sequenced;
C. tropicalis, with 169 SNPs (6.3%) among the 2,677 bases sequenced; and
C. glabrata, with 122 SNPs (3.7%) among the 3,345 bases sequenced (results from our own current databases for these species). The 1.35% SNP return for
A. fumigatus is comparable with the polymorphic site rate of 1.6% reported in a study that compared three intergenic sequences in strains of
A. fumigatus and the closely related taxa
Neosartorya fischeri and
Neosartorya spinosa, both of which fall within the subgenus
Fumigati subgroup
Fumigati (
44). Normally, there is an expectation that intergenic sequences should have greater variability than the coding regions which constitute the majority of our MLST scheme. Rydholm et al. (
44) found that the two
Neosartorya species had greater interstrain variation than
A. fumigatus. The consistently low typeability of
A. fumigatus based on DNA sequences of coding and intergenic regions, combined with the observation that 93% of the isolates in our panel fell into a single clonal cluster by eBURST analysis (Fig.
2), reemphasizes the conclusion that
A. fumigatus must be a recently evolved species and has therefore accumulated fewer mutations.
The fragments of approximately 500 bp used for MLST were chosen primarily because they contained the highest possible numbers of SNPs. They do not represent entire open reading frames (ORFs) and, in the present study, include introns and noncoding regions upstream of ORFs. For
CAT1, Calera et al. previously described five introns in the ORF, with sizes of 66, 49, 85, 56, and 59 bp (from the 5′ to the 3′ end) (
9). Our MLST fragment includes the first two of these introns, but the sizes were 66 and 105 bp, as deduced by comparison of GenBank mRNA data (sequence XM 743457) with our own sequence data. Similarly, for
BGT1, the first of three introns previously described for this gene (
35) was included in our MLST fragment, but comparison of the GenBank mRNA sequence (XM 747418) with our own data indicated an intron of 71 bp instead of the 42 bp originally reported (
35).
Earlier approaches to
A. fumigatus strain differentiation included immunoblot fingerprinting (
8), RFLP typing (
10), randomly amplified polymorphic DNA typing (
31,
33,
40), and DNA fingerprinting by Southern blot analysis and probe hybridization (
18). The discriminatory power of each of these approaches was superior to that of our MLST, but in none of the studies did the isolate population yield more than two-thirds of its number as different types. (MLST and its analogous approaches yielded different types amounting to approximately one-third of the number of isolates tested.) By comparison,
C. albicans MLST generated 351 strain types from a panel of 416 isolates, so 84% of the test population emerged as different types. The discriminatory power of typing by amplified fragment length polymorphisms was not assessed (
53), but the data suggest that this approach is at least as effective as RFLP or randomly amplified polymorphic DNA typing. By far, the most successful typing approach for
A. fumigatus has been microsatellite, or short tandem repeat, analysis. This method has been used successfully by several investigators (
4,
14,
19,
43,
51) and has been shown to have a discriminatory power of 0.99 and better (
4,
14).
To achieve finer discrimination of
A. fumigatus strains, some authors have used different methods in combination (
1,
42,
45). However, applying multiple tests makes the task of typing
A. fumigatus more laborious, and some typing methods may be compromised in their accuracy, for example, because interpretation of band sizes is open to error. The advantages of MLST as a typing approach are its portability (sequencing can be done anywhere, with identical results for identical DNA samples) and its archiveability (results can be stored in a central web database). Our study shows that MLST is a practical proposition for typing
A. fumigatus, but for high-level strain discrimination, microsatellite typing is considerably more effective. This approach, like MLST, also offers portability and archiveability. However, its application to population genetic studies is limited by homoplasy arising from highly mutable sequences.
The
A. fumigatus genome sequence project has revealed genetic elements for sexual behavior (
36-
38), opening the possibility that sexual recombination may occur. No observations of mating or meiosis have so far been made for
A. fumigatus; therefore, this species will remain classified as an asexual haploid fungus for the time being. The
MAT1-
2 idiomorph was more commonly found than the
MAT1-
1 idiomorph (71% versus 29%) in our panel of isolates. Other studies also found
MAT1-
2 to be the dominant mating type: 57% versus 43% of 290 isolates (
37) and 55% versus 45% of 102 isolates (
44). The predominance of the
MAT1-
2 idiomorph may indicate a shift from the 1:1 ratio that is consistent with sexual reproduction. There is some evidence that recombination has occurred in the past for
A. fumigatus (
37); however, other studies (
44) have found the same lack of variation between isolates from all parts of the world, indicating a predominantly clonal mode of reproduction. Compared with
Candida albicans, also considered a largely clonal species, which generates 18 eBURST clonal clusters out of 416 isolates (
47), there is very strong evidence for
A. fumigatus both as a clonally reproducing organism and as a species of a relatively recent origin.
A major application of any microbial strain typing system is to elucidate the epidemiology of infection. The many clinical forms of aspergillosis result from inhalation of
A. fumigatus from environmental sources, yet research attempting to make positive associations between environmental isolates and isolates infecting patients has not easily made such links, regardless of the typing method used. While some investigators have found examples of indistinguishable
A. fumigatus isolates in some individual patients and their hospital environments (
10,
19,
33), others have reported multiple strain types in the same patient and even the same sample (
10,
11,
33). The best evidence for common types in infecting and environmental isolates comes more from statistical clustering of strain types (
10,
17,
19,
53) than from unequivocal evidence of an indistinguishable strain type in a clinical sample and a sample, say, from the patient's ward. One study even drew the conclusion of a negative association between clinical and environmental isolates (
43). By contrast, two studies found indistinguishable isolates in several patients, a situation compatible with cross-infection with a common strain type (
12,
27); the finding is also more likely to arise in studies based on typing methods with low discriminatory power.
We cross referenced our own data with those from another study, because 18 isolates were genotyped in both studies (
41). MLST could further resolve strains that had previously been classified as the same type, with up to seven allelic differences found (
41). However, there was no cross-association between types determined by the two methods (Fig.
2). We have no explanation for why the results from ribosomal DNA typing do not match those from MLST.
MLST is a useful tool for differentiating isolates in an unambiguous manner. However, the inherent lack of sequence variability between A. fumigatus isolates probably restricts the value of MLST to preliminary screening in situations such as putative endemic outbreaks. High-level strain discrimination for A. fumigatus is better served by microsatellite typing.