INTRODUCTION
Leptospirosis is a sometimes lethal, waterborne infectious disease caused by pathogenic
Leptospira (
1–4). Humans are usually infected via contact with contaminated surface water or damp soil, but occupational exposure in kennels, slaughterhouses, and animal farms also occurs, albeit relatively infrequently. Rodents are notorious “reservoir” hosts, but virtually any terrestrial species, including pets and domesticated animals, can carry and disseminate
Leptospira. Transmission is presumably more common in warm, wet locales, but disease is cosmopolitan with a global burden comparable with cholera and typhoid fever (
5,
6). Epidemics grow more frequent, made worse by changing weather patterns and increased overcrowding in urban slum communities (
5–7). Even so, the scarcity of epidemiological data (well-curated or otherwise), insufficient public health awareness, extensive evidence gaps, and a lack of adequate diagnostic tests that, collectively, artificially depress incidence rates, indicate that the public health impact of leptospirosis is greatly underappreciated. Yet, despite several well-documented limitations, mainly their inability to provide definitive results soon enough for clinical interventions to be effective (
8), antibody-based diagnostic tests (ABTs), e.g., the Microscopic Agglutination Test (MAT) or IgM-based ELISAs remain the norm, only worsening the public health outlook.
Newly developed leptospirosis diagnostic assays have tended to focus instead on antigen and nucleic acid detection. Of the latter, most are nucleic acid amplification-based diagnostic
tests (NAATs) that target various leptospiral genes and are designed to provide an early diagnosis of leptospirosis (
9–14). In practice, many have context-related limitations, particularly a low diagnostic sensitivity when epidemiological information is lacking; therefore, few are generalizable. Leptospirosis NAATs-du-jour target
lipL32 (
15–17), a taxonomically restricted gene found almost exclusively in
Leptospira.
Leptospira alleles are distinctive and are found only in pathogenic species, ergo
lipL32-based assays are specific for medically important
Leptospira. They tend to be remarkably consistent (dynamic range of 10
7 to 10
0 [
15]) but produce somewhat unpredictable results with non-
L. interrogans leptospiral species, owing to the high sequence diversity of the targeted gene. Despite these recognized benefits,
lipL32-based assays are not yet routine in clinical or public health settings. Presumably because the best options employ fluorescent probes, which inflate setup and operational costs, rendering them ill-suited to the low-tech realities of many leptospirosis endemic regions. To expand access, less expensive PCR-based assays have been developed (
18), however, these by their nature suffer from inferior (i.e., higher) detection limits and are tedious compared with qPCR-based alternatives. Regrettably,
lipl32-based NAATs provide little insight with respect to infecting
Leptospira species or serotype. Accordingly, they are best suited to the clinical setting where an early diagnosis is paramount.
Some leptospirosis NAATs allow species identification (ID), e.g.,
OmpL1-PCR (
19), but a universal limitation of these assays has been their inability to provide serotype information. Serovar ID remains both clinically and epidemiologically useful, as some serovars, e.g., Copenhageni (shorthand notation,
svCopenhageni), are especially virulent, and many have preferred reservoirs, e.g.,
svCopenhageni for rats, and
svCanicola and
svHardjo for dogs and cattle, respectively (
1). Reputed “gold standard” approaches for serotyping
Leptospira strains, such as the Cross Adsorption Agglutination Test (CAAT), and its modern equivalent utilizing panels of cross-reacting monoclonal antibodies (MAbs), are complicated, expensive, and impractical. Nucleic Acid Based Serotyping (NABS) methods, such as multi-locus sequence typing (MLST) (
20,
21), multi-locus variable number tandem repeat analysis (MLVA) (
22), and, more recently, core genome multi-locus sequence typing (cgMLST) (
23) have limited clinical utility due to their reliance on culture isolation (as do all current
Leptospira serotyping methods) and are impractical in most epidemiological contexts. Indeed, several NABS-rooted schemes have been proposed and abandoned for varied reasons but primarily their inconsistent performance with non-
L. interrogans species. And while a newer qPCR based MLVA approach employing high resolution melt analysis (HRMA) is promising (
24), its complexity and cost has limited its application (
25). Consequently, development of inexpensive, easily deployable, culture-independent, and antigen-detection tests capable of distinguishing
Leptospira serovars is paramount.
Here, we report on the development and validation of the first rapid affordable group I-specific (RA
gI) NAAT capable of reliably distinguishing
Leptospira serotypes in diverse sample types, obviating requirements for culture isolation. The assay, which we dubbed the leptospirosis 11108 RA
gI (or “rage-one”) test, targets LIC_RS05715 (old locus_tag: LIC_11108 [abbreviated 11108 hereafter])—a distinctive, single-copy, leptospiral core gene encoding a putative oligosaccharide flippase (NCBI protein ID:
AAS69715.1; UniProtKB ID: Q72TB3)—and allows joint inference of
Leptospira species and serovar. The data presented here provide crucial proof-of-concept validation for and should spur development of conceptually related leptospirosis
serotyping NAATs, (i.e., sNAATs).
RESULTS
Detection limit and cross-species amplification of deconvoluted (i.e., species-defined) 11108 primer sets.
For most species, the optimized 11108-assay had a detection limit of 10 GEs with expected Tm regardless of diluent (i.e., PBS or NHS). However, to achieve a similar level of detection with borgpetersenii, and santarosai-specific primer pairs, extended runs (>40 to 45 cycles) were required. Under optimized assay conditions, the detection of the borgpetersenii, and santarosai-specific primers was 102 to 103 GEs.
Serovar resolution via traditional block-based thermal cycling.
As summarized in
Table 1, CFX96-generated T
m’s ranged from 80°C to 85°C. Negative controls and NTCs were invariably negative, confirming specific detection of group I pathogenic
Leptospira. Intra- and inter-run T
m’s were remarkably consistent (
σ = 0.0°C) among replicates, with T
m’s defined for each allele (
Table 1). At the manufacturer-recommended resolution of 0.2°C, nine
interrogans/
kirschneri sequence variants or alleles produced only four T
m’s, ranging from 80.2 to 80.8°C, though all could be differentiated at 0.1°C-resolution (not shown). Nonetheless, several important serovars could be distinguished based on 11108-amplicon derived T
m’s, including
svCopenhageni/
svIcterohaemorrhagiae (preferred hosts = peri-domiciliary rats),
svLai (field mice),
svHardjo-prajitno (cattle), and
svCanicola/
svPomona (dogs and pigs, respectively). T
m’s were species-dependent, as expected based upon species-variable GC%. With
interrogans,
kirschneri, and
noguchii alleles producing T
m’s ranging from 8°C to 81°C;
borgpetersenii, 81.°C to 82.5°C;
weilii, 83°C; and
santarosai, >84°C (
Table 1; Fig. S2); joint inference of species and serotype could be made. Therefore,
interrogans svHardjo subtype
hardjoprajitno and
borgpetersenii svHardjo
bovis could be readily distinguished.
We also analyzed reference strains obtained from Peru and Sri Lanka, where we have conducted prospective field studies. All strains originating from Sri Lanka (svGeyaweera/svWeersinghe, svRatnapura, svCeylonica, and svAlice) could be reliably differentiated from each other (80.6°C, 80.4°C, 81.6°C, 85°C), and from svCopenhageni/svIcterohaemorrhagiae (80.2°C). In contrast, Peruvian reference strains originating from outside Iquitos, specifically those belonging to L. santarosai, produced inconsistent melt peaks, precluding confident Tm assignment, which is likely due to reduced primer annealing and, thus, amplification efficiency, from these (as-yet) unknown sequence variants, suggesting that alternative forward and/or reverse primers might be needed (as was the case for some interrogans [e.g., svManilae] and noguchii strains).
Serovar resolution using high-precision magnetic induction thermal cycling.
Owing to a different design, MIC-derived T
m’s were on average ~1°C higher than those produced by the CFX96 but were also reliable (
σ ≤ 0.05). Statistical comparison of the means showed that the superior precision of the MIC improved allele resolution (up to 100 potential distinct T
m’s at 0.05°C resolution, and 200 at 0.025°C resolution). Accordingly, previously unresolved (but distinct) alleles in
svBataviae,
svCopenhageni
/svIcterohaemorrhagiae,
svCynopteri,
svHardjo-prajitno
/svWolffi,
svMankarso,
svAustralis
/svBratislava
/svDjasiman/
svGrippotyphosa/
svPyrogenes,
svAutumnalis,
svCanicola
/svPomona, and
svManilae were assigned distinctive T
m’s (
Table 1). Overall, all nine
interrogans/
kirschneri sequence variants could be reliably distinguished based upon mean T
m’s (
P < 0.05).
Reliability, accuracy, sensitivity, and specificity of 11108-based assay.
During the 5-year-period of December 2002 to June 2007, 144
Leptospira strains were isolated from humans (50), peri-domiciliary rats (34), livestock (32), and wild animals (28) in Iquitos and surrounding areas. The majority (65.3%) were either
interrogans,
kirschneri, or
noguchii. Fifty (34.7%) were identified as
santarosai, with 27 (or 54%) derived from livestock (i.e., buffaloes, cattle, and pigs) representing 84.4% of all isolates obtained from these sources. In contrast, only 14 human isolates were classified as
santarosai (28%), as were six from wild animals (21.4%), two from rats (5.9%), and none from bats or dogs. To test the reliability and accuracy of our optimized 11108-based NAAT, we analyzed a sample of these isolates, inclusive of all species and Serogroups found in the region. Twenty-three were taxonomically classified as
interrogans, 13 as
santarosai, seven as
L.
noguchii, and two as
L.
kirschneri.
Table 2 summarizes the 11108 qPCR-assay results of all human and rat isolates analyzed, each producing a reliable T
m.
Based on CAAT (and/or PFGE-fingerprinting), 23 interrogans strains belonged to either serogroup Icterohaemorrhagiae (with all but two definitively typed as svCopenhageni) or Canicola (svCanicola, and two closely related but seemingly distinct strains, BEL039 and CBC1203R). All produced consistent Tm’s which were verified by BLAST. Thus far, only two strains isolated in Iquitos, HAI1536 and MOR069, have been shown to have similar 11108 amplicons but different CAAT or PFGE fingerprints among the 19 analyzed that were classified as interrogans/kirschneri/noguchii. Similarly, isolate MMD4803—presumed to be distinct based on PFGE fingerprinting—could not be resolved from either of two HAI1378-derived isolates, nor CBC621. Overall, of the 33 isolates analyzed, only two produced 11108-inferred IDs that disagreed with CAAT- or PFGE-based assignments, indicating correct inferences approximately 94% of the time, regardless of isolation date or source (human, clinical versus mammalian reservoir). Species inferences were reliable, and the assay correctly differentiated alleles/sequence variants 100% of the time. As is the case for most “exotic” serovars, sequences were not publicly available for Bataviae strains, VAR132 (80.4°C), MOR069 (80.6°C), nor Shermani strains, HAI1378(U) and CBC621 (84.8°C), precluding definitive serovar assignment. However, these were all confirmed via BLAST best-hits to be distinct and derived from 11108-amplicon variants, and based upon Tm-defined species-ranges, grouped with other serotypes from the expected species, i.e., noguchii and santarosai, respectively. Taken together these findings indicate that the 11108-assay is highly reliable and capable of producing consistent results with strains isolated several years apart and originating from various hosts. Therefore, although Tm values might vary based upon hardware/software options utilized, our approach consistently distinguished sequence variants.
Pilot experiments using archived culture-positive patient serum-samples from Sri Lanka (12 isolates, sample age range = 3 to 5 years old) and Iquitos (25, sample age range 11 to 15 years old) produced mostly consistent results. Ten of 12 recently archived serum-samples were 11108 qPCR positive (~83% sensitivity). Eight produced consistent amplification (i.e., were positive ≥67% of the time) across replicates and repeat runs (minimum of nine replicates), and two amplified less consistently, being positive <67% of the time. That said, all produced expected T
m’s with respect to their cognate isolate.
Fig. 1 shows melt-curves from six of these archived samples. Amplification of older samples (i.e., 11 to 15 years old) was less reliable, although all were positive with amplification evident in at least one replicate in all runs (i.e., ~33%). And regrettably, most produced indefinite or quasi-stable T
m’s (±0.1°C), though some, e.g., serum-sample PAD451 (81.0°C), produced reliable T
m’s agreeing with CAAT-based classification of its cognate isolate, indicating that further refinement of our assay could allow retrospective analysis of bio banked serum-samples.
Forty-four dog urine samples were initially classified as culture negative, leading to the presumptive identification of 11 false-positive samples (culture negatives that were positive by at least one real-time qPCR assay). Because culture is inefficient (<10% success rate, in our hands), these presumed false positives were verified to contain
Leptospira DNA by shotgun metagenomics sequencing (SMS). All were positive by 11108-based real-time qPCR, of which four were also positive by
lipL32 qPCR (not shown). Partial
Leptospira genomes were reconstructed from all but two, which produced mostly low-quality reads. Of the initial 84 samples, complete data were produced for 80: 47 verified positives and 33 negatives. Assay results were used to construct a contingency table to assess sensitivity and specificity (
Table 3). The sensitivity (true positives detected) was clearly superior to the
lipL32-based assay used for comparison while maintaining 100% specificity.
DISCUSSION
Recognizing the public health importance of real-time serovar ID, we developed a novel single-tube NAAT capable of providing reliable Leptospira serovar predictions directly from diverse sample types. Rigorous validation against dog urine samples proved our assay to be highly specific for group I pathogenic Leptospira, with no detectable off-target amplification of even genetically related (but usually hypovirulent) group II species or of unrelated, poorly characterized microorganisms in complex microbial species-consortia. It dramatically outperformed a mainstream lipL32-based alternative and proved unique in its capacity to provide joint species and serotype IDs, while consistently detecting fewer than 10 genome equivalents/sample in serum and urine. From a public health perspective, the early successes of our approach provide pivotal proof-of-concept validation that should spur development of a newer class of NAATs specifically for the detection and direct identification of Leptospira serovars in varied clinical settings and epidemiological contexts.
The MAT was formerly the gold standard for leptospirosis diagnosis. But, owing to well-documented limitations (and antibody-based diagnosis in general), it has been steadily supplanted by a bevy of NAATs, with
lipL32-based real-time qPCR assays now routine. However, until now, NAATs by design could not generate epidemiologically useful information. Some 16S rRNA-based assays yield species IDs (
18), but these alone provide limited epidemiological information, as many important but distinct serovars belong to the same species, making it nigh impossible to assess the relative importance of potential reservoir species, domesticated or otherwise, in the maintenance and transmission of human leptospirosis. Furthermore, as diagnostic tests, 16S rRNA-based assays, although sensitive, are mostly genus-inclusive (and not specific for pathogenic
Leptospira); they tend to produce false-positive results, and are practically unusable with certain sample types, such as urine (
26). In contrast, our assay and
lipL32-based alternatives are highly specific for pathogenic species, but unlike the latter, ours proved to be as sensitive as those targeting the 16S rRNA gene. Moreover, of the three design philosophies, only our assay yielded joint predictions of infecting species and serovar. Therefore, based on its validated sensitivity and specificity, empirically determined detection limit, and the verified accuracy of its predictions, this new NAAT holds considerable promise for real-time serovar ID, an important milestone for public health leptospirosis surveillance programs.
We also introduce the concept of “sequence-variant allied” or “allele-specific” Tm’s (i.e., melt-temperatures), which we use interchangeably and show that under optimal conditions all unique 11108-amplicon variants could be reliably differentiated by this metric alone. More importantly, assay predictions were reliable (i.e., unexpected Tm’s were all well-supported by SNPs in sequenced amplicons), improving upon the MAT, which is currently the only other diagnostic test permitting inference of infecting serovar.
More importantly, assay predictions were reliable (i.e., unexpected T
m’s were all well-supported by SNPs in sequenced amplicons), improving upon the MAT, which is currently the only other diagnostic test permitting inference of infecting serovar. Sadly though, MAT-based predictions are mostly inaccurate (
27,
28) or ambiguous (
29), yet presumably improved through analysis of paired (acute and convalescent) sera. As far as we know, the two most medically important serovars worldwide,
L. interrogans svCopenhageni and
svCanicola, were easily and reproducibly distinguished, as were others with shared host preferences. Even multispecies serovars, such as
svHardjo
interrogans subtype prajitno and
borgpetersenii subtype bovis, were easily distinguished, as were novel or poorly characterized serovars with little to no genome information, which could be differentiated from each other (and from common serovars) based on T
m, or amplicon-derived sequence data (as was the case with buffalo isolate, CBC621). As more genomes are sequenced and additional sequence variants are identified, further improvements or customizations are not only possible but easily developed. Even so, LIC_11108 is but one of many leptospiral genes we have identified (so far) that have clinical utility as targets of rapid diagnostic and/or prognostic tests.
Our approach is not intended to replace specialized serotyping methods, such as MLST profiling or PFGE fingerprinting, which by design have superior resolving power, but rather to address a clear need for NAATs capable of rapid and reliable serovar ID directly from varied sample types. NAATs that are readily adapted to diverse clinical settings and public health surveillance initiatives. Additionally, in recent years, emphasis has been placed upon bridging the gap between historical knowledge of
Leptospira serological and molecular techniques, with growing concern that crucial epidemiological information will be lost in the transition to modern rapid diagnostics (
30). We believe that our assay (and others adopting similar design principles) could ultimately bridge this gap.
MATERIALS AND METHODS
Primer design and validation.
For real-time qPCR-based serovar identification (ID), genes comprising the core
Leptospira genome were first defined using the Pan-Genome Ortholog Clustering Tool, PanOCT (
31), and those predicted to encode proteins participating in LPS assembly or export, e.g., O-antigen polymerases (
rfbY), O-antigen/polysaccharide flippases,
rfbX and LIC_11108, and the
Leptospira lipopolysaccharide export system (Lpt), including
msbA,
lptA, and
lptD, were used to construct exhaustive amino-acid-guided
multiple
sequence
alignments (MSAs) of coding sequences (CDS) retrieved from GenBank—anticipating that these would be serogroup- and (potentially) serovar-dependent, due to their stereospecificity (
32). LIC_11108 was chosen for initial evaluation based on several criteria (the most notable summarized in the preceding paragraph). From these MSAs, hypervariable regions were identified and targeted for amplification by primers complementary to conserved flanking sequence (Fig. S1).
As none of these flanking regions were 100% conserved, degenerate primers 11108
f: 5′–TT
R AA
N GA
R AGT ATA AAA CTT CC–3′, and 11108
r: 5′–AAC GG
R CT
Y TTT CAA TA
Y GC–3′ (
Table 4)—specific for group I pathogenic
Leptospira—were synthesized. Pilot experiments were conducted using 11108
f and 11108
r to confirm amplification of globally distributed
Leptospira species:
L. interrogans,
L. kirschneri,
L. noguchii,
L. borgpetersenii,
L. weilii, and
L. santarosai. Amplification of
L. mayottensis, which has a more restricted distribution, was assessed via
in silico PCR of available genome sequences. Baseline assay conditions were established using degenerate primers, following which optimal conditions were determined using deconvoluted species-specific primers (
Table 4). Cognizant of our intended longer-term development goals, we opted for an affordable SYB green-based NAAT that could be easily customized for probe-based assays (should the need arise) and evaluated its performance on affordable hardware/software combinations, including a four-channel magnetic induction thermal cycler (MIC) manufactured by Bio Molecular Systems (~40% cost of CFX96).
Detection limit, cross-species amplification of deconvoluted (i.e., species-defined) 11108 primer sets.
Genomic DNA (gDNA) for qPCR (and sequencing) was prepared from interrogans svLai, santarosai svShermani, and borgpetersenii svTarassovi using the manufacturer-recommended extraction protocol for Gram-negative bacteria (GeneJET DNA purification kit, Thermo Fisher Scientific) and then amplified on a CFX96 thermal cycler (Bio-Rad). Amplicons were cloned into pCR-2.1 TOPO to create species-specific reference plasmids, which were transformed into E. coli following manufacturer’s directions (TA Cloning Kit, Invitrogen). After routine minipreps (GeneJET Plasmid DNA Miniprep Kit, Thermo Fisher Scientific), purified (reference) plasmids were quantified (individually) by fluorometry (Qubit, Thermo Fisher Scientific), and their copy number estimated. This was followed by serial dilution in either PBS buffer or heat-inactivated normal human serum (NHS); re-extraction was carried out using the QIAamp DNA Blood minikit (Qiagen) per manufacturer-recommended protocol for serum, and used as template for qPCR.
As 11108 genes occur once per Leptospira genome, copy numbers could be expressed as genome equivalents (GEs). The detection limit was defined as minimum number of GEs producing the expected Tm’s among all technical replicates. Reference plasmids were also used as template in amplifications evaluating the propensity of deconvoluted primer pairs for cross-species amplification.
Serovar resolution via traditional (i.e., block-based) thermal cycling.
Reaction mixes consisted of the following: 10 μL of Precision Melt Supermix, 2 μL of template
gDNA, and 4 μL of both forward and reverse primers, yielding final (empirically optimized) primer concentrations of 0.4 μM per 20 μL reaction. The finalized thermal cycling profile consisted of a 3-min hold at 95°C (denaturation), followed by 44 cycles of 95°C for 10 s, 62°C for 30 s (empirically optimized annealing temperature) and 72°C for 1 min (extension), and a final hold at 72°C for 7 min to ensure full extension of PCR product. Amplification was followed by melt-curve peak identification (via Precision Melt Analysis Software) with fluorescence measured at 0.2°C increments (default). All samples were run in triplicate, and runs were repeated at least three times to produce a minimum of nine data points/per sample. Cq values of ≤ 40 cycles were considered positive, but postcutoff amplification (i.e., Cq > 40) with appropriate melt peaks were recorded, and later verified by sequencing. Milli Q water was run in parallel, in triplicate, as a no template control (NTC).
gDNA from
L.
licerasiae Varillal svVarillal (archetype of group II, comprising low virulence
Leptospira species), the genetically related
L.
wolffii svKhorat, and premixed
gDNA from Gram-negative and Gram-positive bacteria,
Citrobacter rodentium (ATCC DBS 100),
Escherichia coli (EPEC E2348/69),
Clostridium difficile (ATCC BAA-9689), and
Staphylococcus aureus (MARSA, USA 300), were used as negative controls (to assess nonspecific “off-target” amplification). For each serovar, a mean T
m was calculated after first identifying and then removing outlier data points via robust
regression and outlier removal (ROUT) (
33) analysis (Q = 10%, Prism v7.0a). Sequence-variant specific T
m’s were defined via comparison of sample-means by two-tailed, nonparametric
t test (
P < 0.05, Prism v7.0a).
qPCR results were verified by conventional Sanger sequencing of cloned amplification products. Twenty-five randomly chosen colonies were subcultured into LB broth, and plasmids were isolated using a QIAprep Spin Miniprep Kit (Qiagen) and then sent for sequencing. Consensus sequences were aligned using the MAFFT multiple sequence alignment (MSA) algorithm with default parameter settings. Individual consensus sequences were submitted to the Basic Local Alignment Search Tool (BLAST) server to confirm specificity (
34). For serovars consistently yielding inexact or a quasi-stable T
m’s, we sequenced 10 randomly selected clones producing six reads per clone (for a total of 60), and then compared clone-specific consensus sequences to identify variants.
Serovar resolution via high-precision magnetic induction thermal cycling.
Reaction mixes and thermal cycling profile were as described above, however, postamplification melt-curve analysis absorbance readings were taken at increments of 0.025°C, exploiting the superior precision of MIC. Negative controls and NTCs were run in triplicate, in parallel, as described above. And as before, outliers were removed by ROUT analysis before comparison of serovar-associated Tm by two-tailed, nonparametric t test. Data were visualized via a box-and-whisker plot.
Reliability, accuracy, sensitivity, and specificity of 11108-based assay.
To assess assay reliability, amplifications were done using different hardware/software combinations: (i) CFX96 with melt-curve analysis via Precision Melt Software; (ii) CFX96 with default melt-curve analysis (i.e., without Precision Melt Analysis Software); and (iii) MIC with bundled melt-curve analysis software.
To test accuracy (and thus, its potential epidemiological utility), we analyzed 45 Leptospira isolates obtained over a 5-year period from febrile patients, domesticated animals, and rats in Iquitos, Peru (Table S2), and 25 isolates from hospitalized patients in Sri Lanka, assessing agreement among Tm’s, PFGE- and CAAT-derived serovar assignments. Outlier removal, statistical comparisons, and data visualization were as described above. Discrepancies among Tm’s, PFGE and CAAT-based serotyping were resolved by Sanger-sequencing (of amplified products).
Assay sensitivity and specificity were determined using 84 canine urine samples, which comprised 40 culture positives and 44 randomly selected culture negatives and were sourced from a study of canine leptospirosis in Sokoto, Nigeria (
35), as well as 12 culture-positive acute-phase serum-samples from a prospective study of human leptospirosis in Sri Lanka (
36). Because
Leptospira species and serovars in Sokoto and study sites in Sri Lanka are not well-characterized, all samples were amplified using premixed, species-appropriate primers. Shotgun metagenomics sequencing (SMS) data were generated from qPCR-positive, culture-negative samples to identify potential false-positives.
For comparison, dog urine samples were also assayed for the presence of leptospiral DNA by lipL32 qPCR using the following forward and reverse primers: lipL32_178f (5′–TCT GTG ATC AAC TAT TAC GGA TAC–3′) and lipL32_419r (5′–ATC CAA GTA TCA AAC CAA TGT GG–3′), specific for virulent group I pathogenic species. The (previously) empirically optimized lipL32-based thermal cycling profile consisted of the following: 3 min denaturation at 95°C, 45 amplification cycles (95°C for 10 s/62°C for 30 s/72°C for 60 s), 7 min extension at 72°C followed by postamplification melt-peak identification. For qPCR, gDNA from svManilae was used as a positive control and as before, premixed gDNA from four bacterial species was used as a negative control.