The rapid emergence of SARS-CoV-2 variants raised public health questions concerning the capability of diagnostic tests to detect new strains, the efficacy of vaccines, and how to map the geographical distribution of variants to understand transmission patterns and loads on healthcare resources. Next-generation sequencing (NGS) is the primary method for detecting and tracing new variants, but it is expensive, and it can take weeks before sequence data are available in public repositories. This article describes a customizable reverse transcription PCR (RT-PCR)-based genotyping approach which is significantly less expensive, accelerates reporting, and can be implemented in any lab that performs RT-PCR. Specific single-nucleotide polymorphisms (SNPs) and indels were identified which had high positive-percent agreement (PPA) and negative-percent agreement (NPA) compared to NGS for the major genotypes that circulated through September 11, 2021. Using a 48-marker panel, testing on 1,031 retrospective SARS-CoV-2 positive samples yielded a PPA and NPA ranging from 96.3 to 100% and 99.2 to 100%, respectively, for the top 10 most prevalent World Health Organization (WHO) lineages during that time. The effect of reducing the quantity of panel markers was explored, and a 16-marker panel was determined to be nearly as effective as the 48-marker panel at lineage assignment. Responding to the emergence of Omicron, a genotyping panel was developed which distinguishes Delta and Omicron using four highly specific SNPs. The results demonstrate the utility of the condensed panel to rapidly track the growing prevalence of Omicron across the US in December 2021 and January 2022.


Since the beginning of the COVID-19 pandemic, variants have emerged that have the potential to evade vaccines, cause diagnostic test performance issues, or cause more severe disease (15). Monitoring and surveillance of the genetic lineages of SARS-CoV-2-positive samples are critical to the timely identification of emerging variants (68). In November 2021, Omicron, a highly divergent variant harboring multiple mutations, was reported in South Africa (9). While data suggest that Omicron may be associated with reduced risk of hospitalizations compared to Delta, the increased transmissibility and rapid global spread of Omicron highlighted the need for a fast and cost-effective approach for early detection of such variants (1012).
SARS-CoV-2 genetic lineages in the US are primarily monitored by next-generation sequencing (NGS) on a random selection of approximately 5% percent of SARS-CoV-2-positive samples (13, 14). While extremely accurate at detecting existing circulating variants, NGS does not allow for the timely identification of emerging variants. More focused approaches, such as genotyping using single-nucleotide polymorphism (SNP) assays, offer significant advantages in terms of cost, throughput, and efficient result reporting. Several reports have shown that SNP assays provide rapid turnaround times compared to NGS (1517). Harper et al. (18) previously demonstrated the utility of a SNP genotyping panel to inexpensively identify SARS-CoV-2 genotypes circulating between March and May 2020. However, previous methods required a two-step approach to genotype a sample: (i) identify a positive sample using reverse transcription PCR (RT-PCR) or an antigen assay; and (ii) genotype the positive sample using a SNP panel. The documented approach seeks to expand upon earlier efforts by developing an assay which simultaneously confirms a SARS-CoV-2-positive sample and identifies its genotype. SARS-CoV-2-positive samples with undetermined genotyping variant classification could provide a prescreened sample set for NGS and potentially allow for early identification of emerging variants.
The National Institutes of Health (NIH) Rapid Acceleration of Diagnostics (RADx) initiative created a Variant Task Force (VTF) in January 2021 to assess the impact of emerging SARS-CoV-2 variants on in vitro diagnostic testing (19). In July 2021, the NIH RADx VTF also initiated an effort to develop a SARS-CoV-2 RT-PCR assay for variant agnostic detection of SARS-CoV-2, as well as early detection and monitoring of SARS-CoV-2 variants. Its aims were as follows: (i) identify SARS-CoV-2 markers useful for the detection of SARS-CoV-2-positive samples across all variants; (ii) develop a panel of SNP markers that can be used to accurately assign lineages to SARS-CoV-2-positive samples; and (iii) implement a genotyping approach and platform for the early detection of new and re-emerging variants which signals when markers need to be updated.
This paper outlines the results of the performance validation of the initial genotyping assay and associated marker sets. It also describes how this approach was rapidly adapted to develop a targeted panel of four mutations—three for Omicron and one for Delta—for the purpose of identifying Omicron, and how this Omicron genotyping panel was implemented in several diagnostic labs. The data from this study, as well as associated statistics and trends, are available on a publicly accessible dashboard (20).


Marker selection.

Data analysis for identifying SARS-CoV-2 markers was performed using the Variant Analysis for Diagnostic Monitoring system (ROSALIND, San Diego, CA). Genome sequences and metadata used for the selection of markers in this study were obtained through a Direct Connectivity Agreement for complete daily worldwide downloads from the GISAID EpiCov database (2123). Sequences not tagged with “is complete” and sequences with an “n_content” of more than 0.05 were excluded. Pairwise whole-genome alignments of all sequences were performed using LASTZ v1.04.02 with the National Center for Biotechnology Information (NCBI) Reference Sequence: NC_045512.2 as the SARS-CoV-2 reference genome (24, 25). The Bioconductor package for genetic variants, VariantAnnotation v1.20.2, was then used for the translation into amino acids in R v3.3.2, and the identification of amino acid substitutions or frameshifts was used to call a unique mutation incident (26, 27).
Selection of the lineages considered for the marker panel was performed by combining the top 100 most frequently reported lineages worldwide for the 120-day period between May 12 and September 11, 2021 (data not shown). A total of 1,200,791 sequences representing 393 lineages were analyzed. The top 10 most unique mutations for each World Health Organization (WHO) label were then identified, and multiple combinations of these unique mutations were evaluated to classify a viral sequence into a WHO label with at least 90% overall accuracy. Additional mutations were added to ensure coverage for the Centers for Disease Control and Prevention (CDC) Variants Being Monitored (VBM), Variants of Interest (VOI), Variants of Concern (VOC), and Variants of High Consequence (VOHC).
The positive-percent agreement (PPA) and negative-percent agreement (NPA) for each marker set compared to NGS was calculated according to the Clinical and Laboratory Standards Institute (CLSI) EP12-A2: User Protocol for Evaluation of Qualitative Test Performance (28). A classifier algorithm was developed to measure the presence, absence, and combinations of mutations to accurately assign the WHO label classification. A dedicated system was established to host the classifier algorithm and provide a web application with application programming interface capabilities for standardized data submission and processing (29). This system was established on a secure virtual private cloud instance on the Google Cloud Platform with the ability to process thousands of specimens per minute.


The 1,128 retrospective clinical samples (1,031 SARS-CoV-2-positive with sequences in GISAID and 97 SARS-CoV-2-negative) used in this investigation were collected from April through December 2021 by Helix OpCo and the University of Washington (UW). The 26,788 prospective clinical samples (all previously determined SARS-CoV-2-positive and including 11,849 with sequence information used to validate the 4-marker panel) were collected from November 2021 through January 2022 by Aegis Sciences Corporation, Helix OpCo, and UW. All clinical samples, retrospective and prospective, were collected predominantly in the states of Washington, Florida, and California. Aegis Sciences Corporation, Helix OpCo, and UW are Clinical Laboratory Improvement Amendments (CLIA)-certified labs participating in the CDC National SARS-CoV-2 Strain Surveillance sequencing program to monitor variant distribution in the US (30). The Pearl independent institutional review board (IRB) gave ethical approval for the use of Aegis Sciences Corporation de-identified remnants of clinical testing. Western Institutional Review Board–Copernicus Group, the institutional review board of record for the Helix Respiratory Registry, gave ethical approval for the use of Helix OpCo de-identified remnants of clinical testing. Use of the UW de-identified excess clinical specimens was approved with a consent waiver by the UW IRB.

Genotyping assay.

A panel of single well SNP assays, each containing two primers and a duplex of VIC and FAM dye-labeled probes detecting the reference and mutant alleles, respectively, was developed using previously described methods (31). Primers were selected based on mapping to genome regions with a mutation frequency of less than 1%, ensuring that no major polymorphisms interfered with the primers. Primer sets were designed such that amplicon sizes were below 150 bp. Minor groove binder probes were designed to achieve optimal discrimination between the two alleles by taking the position, nucleotide composition, melting temperature (Tm), and the type of allele into consideration. The Tm of the primers ranged from 59 to 62°C and the Tm of the probes ranged from 59 to 65°C. Viral RNA was automatically extracted using the MagMAX Viral/Pathogen II Nucleic Acid isolation kit (Thermo Fisher Scientific, South San Francisco, CA). Within 24 h of extraction, reverse transcription quantitative PCR (RT-qPCR) using the selected panel was performed using the TaqPath 1-Step RT-qPCR Master Mix, CG (Thermo Fisher Scientific) on a QuantStudio 7 Real-Time PCR System or ProFlex 2 × 384-well PCR System (Thermo Fisher Scientific) followed by endpoint data collection using the QuantStudio 7 Real-Time PCR System. Data were analyzed using the TaqMan Genotyper v1.6 software (Thermo Fisher Scientific). The software algorithm used the normalized reported emission (Rn) of VIC (x axis) versus the Rn of FAM (y axis) from amplification of the reference and mutant alleles to obtain genotype calls in samples with as few as 10 RNA copies per reaction. All the materials required to run this genotyping assay—instruments, consumables, reagents, software, and specific assays for each of the markers—are available commercially (32, 33).

Next-generation sequencing.

The SARS-CoV-2 sequencing and consensus sequence generation methods used in this study have been previously described (34, 35).


Complete marker panel.

The complete genotyping assay panel consisted of 45 lineage specific markers and 3 positivity markers (Table 1).
TABLE 1 Set configurations for 48-, 24-, 16-, 12-, and 8-marker setsa
Nucleotide mutationsAA mutationMarker setClassification outcome
nsp10 geneNone+++++SARS-CoV-2 detected
N gene SC2None+    
G22018TS:W152C++++ Epsilon
A16500Corf1b:Q1011H++++ Iota
T22917AS:L452Q++++ Lambda
A11456Gorf1ab:I3731V+++  Delta
A28699GNone+++  Eta
G23593CS:Q677H+++  Eta
A24775TS:Q1071H+++  Kappa
TACATG21765—S:HV69–++   Alpha
TTA21991—S:Y144-++   Alpha (when combined with T16176C)
CTTTACTTG22281—S:LLA241—++   Beta (when combined with A21801C)
T733CNone++   Gamma
T22917GS:L452R++   Delta (or Epsilon when combined with G22018T)
A22320GS:D253G++   Iota (when combined with A16500C)
G23012CS:E484Q++   Kappa (when combined with A24775T)
C27925AORF8:T11K++   Mu
G22132TS:R190S+    Gamma
C23604GS:P681R+    Delta
C25469TORF3a:S26L+    Delta
G5629TNone+    CDC VBM, VOI, VOC, or VOHC
CDC, Centers for Disease Control and Prevention; VBM, Variant Being Monitored; VOI, Variant of Interest; VOC, Variant of Concern; VOHC, Variant of High Consequence.

Variant agnostic positivity markers.

The 3 variant agnostic markers selected to detect SARS-CoV-2 positivity were as follows: (i) the S gene: D614G mutation, a nonsynonymous mutation present in nearly all SARS-CoV-2 sequences which results in the replacement of aspartic acid with glycine at position 614 of the viral spike protein; (ii) a conserved sequence in nsp10 (nucleotides 13,025 to 13,441); and (iii) a conserved sequence identified by the CDC in the N gene SC2 region (nucleotides 29,461 to 29,482) (36).
A total of 1,128 retrospective samples (1,031 SARS-CoV-2-positive with sequences in GISAID and 97 SARS-CoV-2-negative) collected between April and December 2021 were evaluated using the variant agnostic positivity markers (Table 2). The combined markers were detected in all but 7 of the 1,031 SARS-CoV-2-positive samples. The quantification cycle (Cq) range for these 7 positive samples was 22 to 33. The PPA using any combination of 2 or more markers was greater than or equal to 98.9% with the criteria being that 1 marker detected was enough to make a positive call. Additionally, the PPA using 1 marker was greater than or equal to 96%. There were no false-positive results (data not shown).
TABLE 2 Variant agnostic positivity markers in vitro performance
No. of markersVariant agnostic markerPositive calls (n)aPPA (%)
S:D614Gnsp10 geneN gene SC2
2++ 1,02098.9
+ +1,02199
1+  99396.3
 + 1,01898.7
0   70.7
Total SARS-CoV-2-positive samples: 1,031.

Lineage assignment.

The performance of the genotyping assay panel and the associated classifier was determined by in silico and in vitro studies with retrospectively collected SARS-CoV-2 specimens.
A bioinformatics simulation was performed using GISAID SARS-CoV-2 sequence data from the first week of each month from November 2020 to October 2021. A total of 323,148 GISAID sequences were analyzed. With the 48-marker set, for the top 10 most prevalent WHO lineages as of September 11, 2021 (Alpha, Beta, Gamma, Delta, Epsilon, Eta, Iota, Kappa, Lambda, and Mu), simulated PPA ranged from 80.7% to 99.9% and simulated NPA ranged from 98.1% to 100% (Table 3). The performance for the Kappa variant was impacted by reporting from Asia and Oceania, where many Kappa-positive samples were misclassified as Delta.
TABLE 3 Performance of 48-marker set in silico classifiera
Lineage48 Markers
Simulated PPA (%)Simulated NPA (%)
PPA, positive percent agreement; NPA, negative percent agreement.
We next determined the clinical performance of the genotyping assay and classifier. The 1,031 SARS-CoV-2-positive samples were genotyped and classified with the 48 markers shown in Table 1. The classifications were then compared to the Phylogenetic Assignment of Named Global Outbreak Lineages (Pango) assignment based on the whole-genome sequences in the GISAID database (Table 4) (37). For the top 10 WHO lineages, the PPA ranged from 96.3% to 100% and the NPA ranged from 99.2% to 100%. The classifier categorized an additional 78 samples as undetermined (data not shown). Pango assigned 77 of these samples to 14 lineages for which the genotyping assay does not include specific markers (Zeta, B.1, B.1.1.507, B.1.2, B.1.221, B.1.241, B.1.517, B.1.596, B.1.609, B.1.625, B.1.628, B.1.634, B.1.637, and C.36.3), and did not classify 1 of these samples.
TABLE 4 Performance of 48-marker set in vitro classifier
Pango assignmentResultClassifier call (n)Percent agreement (%)a
Positive or negative percent agreement.

Marker reduction.

To optimize assay performance in terms of sample input, reductions of the 48-marker panel were explored. We assessed the performance of 24-, 16-, 12-, and 8-marker sets which were defined based on mutation combination performance and targeted lineage prevalence during the 120-day period between May 12 and September 11, 2021 (Table 5). Each of the panels also included 2 of the variant agnostic positivity markers (nsp10 gene and S:D614G), which were used as assay internal controls. The 48-, 24-, and 16-marker sets identified the top 10 WHO lineages, while the 12- and 8-marker sets identified 8 and 6 of the top 10 WHO lineages, respectively.
TABLE 5 Performance of 48-, 24-, 16-, 12-, and 8-marker set in vitro classifiersa
Classifier call48 markers24 markers16 markers12 markers8 markers
PPA (%)NPA (%)PPA (%)NPA (%)PPA (%)NPA (%)PPA (%)NPA (%)PPA (%)NPA (%)
PPA, positive percent agreement; NPA, negative percent agreement.
Cannot call.
The identification of samples that could not be determined by the classifier was further investigated. The number of undetermined samples for each marker set are shown in Fig. 1. The percentage of undetermined samples ranged from approximately 7% to 11% for the 12- to 48-marker sets and increased dramatically to 27% with the 8-marker panel. The shift from a 48- to 8-marker set is associated with 6-fold increased throughput and 6-fold cost reduction, assuming a fixed cost per single-well SNP assay.
FIG 1 Number of undetermined calls in in vitro classifier performance analysis.

Early detection of new variants.

One of the aims of this study was to develop a genotyping approach for the early detection of emerging variants. An increase in the number of undetermined calls by the classifier provides a signal for focused sequencing of those samples, potentially allowing early detection of new variants. To test this hypothesis, a bioinformatics simulation was performed using a modification of the 12-marker panel. The 2 Delta-specific markers were removed to simulate what would have been observed before and during the emergence of the Delta variant. The number of undetermined calls in the first week of each month from March through July 2021 is shown in Fig. 2. Panel A. The 12-marker set without the 2 Delta-specific markers was able to assign lineages to all positive samples in GISAID for North America in November and December 2020 (data not shown). The numbers of undetermined calls were 5 of 16,901, 7 of 17,542, and 1 of 21,869 in January 2021, February 2021, and March 2021, respectively (all less than 0.05%). In April 2021, the number increased to 51 of 40,398 (0.1%), followed by a rapid increase over the following 3 months to 12,825 of 16,615 (77.2%) undetermined calls in July 2021. Then, results were compared to the average daily Delta prevalence in the USfrom March to July 2021 as reported by the CDC (Fig. 2B). The prevalence data for the emerging Delta variant mirrors the rate of increase in undetermined calls over the same period.
FIG 2 Undetermined calls using 12-marker set without Delta-specific markers (A) compared to average daily Delta prevalence in the US from March through July 2021 (B). Panel B image courtesy of outbreak.info (40).

Addition of Omicron markers.

In response to the emergence of the Omicron variant in November 2021 in South Africa, an assay was rapidly developed containing markers specific for this variant. Sequence analysis of the first 132 Omicron sequences revealed 3 markers—ORF1ab:A2710T, ORF1ab:T13195C, and S:T547K—which were found in high percentages of these sequences. Based on in silico modeling, there was greater than 99% concurrence between the Pango assignment based on the GISAID sequence and the combined 3 markers (data not shown). Subsequently, a genotyping assay consisting of the 3 Omicron-specific markers and 1 Delta-specific marker (S:T19R) was developed. Variant agnostic positivity markers were excluded from the 4-marker set evaluated in this study using previously determined SARS-CoV-2-positive samples but could be readily added for simultaneous interrogation of positivity and genotyping.
A total of 11,849 previously determined SARS-CoV-2-positive samples were collected and genotyped between November 2021 and January 2022 (Table 6). Sequencing confirmed that these samples consisted of 8,870 Omicron, 2,630 Delta, and 105 samples from 12 other lineages (A, B, B.1, B.1.1, B.1.1.161, B.1.1.26, B.1.1.305, B.1.1.519, B.1.350, B.1.551, B.1.609, and B.3.1), as well as 244 samples that were not classified by Pango and thus excluded from analysis. The 4-marker panel for Omicron genotyping correctly identified 99.8% of the Omicron samples and 93.8% of the Delta samples. The 4-marker panel incorrectly identified 7 Delta samples as Omicron and classified 156 Delta samples as undetermined. The panel also classified 14 Omicron samples as undetermined. The 105 samples representing other lineages were classified as 5 Delta, 92 Omicron, and 8 undetermined.
TABLE 6 Performance of 4-marker set in vitro classifier
Classifier callResultPango assignmentPercent agreementa
Positive or negative percent agreement.

Prevalence of Omicron in the US in December 2021 and January 2022.

We next deployed the 4-marker panel in three CLIA-certified labs and genotyped an additional 14,939 previously determined SARS-CoV-2-positive samples, for a total of 26,788 SARS-CoV-2-positive samples collected between November 2021 and January 2022. Using the 4-marker panel, it was determined that the relative prevalence of the Omicron variant grew from approximately 40% on December 13, 2021, to more than 90% 2 weeks later on December 27, 2021, while Delta decreased from approximately 60% to less than 10% over the same period. By January 18, 2022, the prevalence of the Omicron variant further increased to over 95% and Delta further decreased to less than 5% (Fig. 3).
FIG 3 Four-marker set in vitro Delta and Omicron classifier calls.


The SARS-CoV-2 virus continues to mutate at an unprecedented scale. NGS is the main method used to track the emergence of new variants; however, NGS technology is expensive and current reporting to GISAID and national regulatory authorities typically takes several weeks (1517). As of January 27, 2022, there were 7,457,886 sequences in GISAID. Nearly 80% of these sequences were deposited by 10 countries (US, United Kingdom, Germany, Denmark, Canada, France, Japan, Sweden, Switzerland, and India), and 90% by two continents (Europe and North America) (2123). With approximately 400 million COVID-19 cases reported since the start of the pandemic, this translates to geographically limited sequencing of less than 2% percent of available samples (38). Clearly, there is a need for a more widely accessible and efficient method to detect the emergence of new variants.
The use of a high-throughput, low-cost RT-PCR genotyping panel was shown by previous authors to enable rapid identification of circulating SARS-CoV-2 variants. Neopane et al. (39) demonstrated good concordance between their assay and sequencing for variants circulating between March and July 2021. Out of 150 SARS-CoV-2-positive specimens, 69 (46%) were B.1.617.2, 49 (32.7%) were B.1.1.7, 4 each were P.1 and P.2 (2.7% each), 3 were B.1.526 (2%), and 2 each were B.1.351 and B.1.427 (1.3% each). An additional 17 (11.3%) had a mutation only in D614G. However, 13 of the 14 SNPs used in the panel by Neopane et al. were from the S gene, and many of these mutations are now found in different SARS-CoV-2 lineages (40).
Korukluoglu et al. (41) described a one-step RT-qPCR assay to detect N501Y and HV69-70del using allele-specific forward primers, reserving ORF1ab as an internal control. This was a relatively small study with 165 specimens, and the authors observed 100% concordance with results of Sanger sequencing and NGS. Vogels et al. (42) reported on a RT-qPCR assay to detect ORF1a SGF3675-3677del and spike HV69-70del. This assay was concordant with 76 sequenced specimens. Perchetti et al. (43) utilized a two-step approach combining the CDC-based laboratory-developed RT-qPCR and the Thermo Fisher TaqPath COVID-19 RT-PCR to identify B.1.1.7 variants. However, the Perchetti et al. approach employed labor-intensive droplet digital PCR (ddPCR) and depended on the S gene dropout, which is now known to occur in multiple variants (4448).
Harper et al. (18) developed a genotyping panel to detect variants identified from SARS-CoV-2 sequences surveyed between March and May 2020 and tested this on 50 stored qRT-PCR positive SARS-CoV-2 clinical samples. They initially identified 22 SNPs that could discriminate 15 different genotypes, but subsequent analysis on a larger sequence data set indicated that their approach required 51 markers to maximize sample discrimination. The largest nucleic acid amplification test-based genotyping series thus far was a national effort in France recently reported by Haim-Boukobza et al. (49) The authors used two separate assays to screen for the HVdel69 to 70 and N501Y mutations in 35,208 samples. However, this approach was unable to genotype 19% of the samples.
We sought to expand upon these previous studies by analyzing more than 7.5 million sequences from GISAID to select target SNPs. Our study goals were 3-fold: (i) identify SARS-CoV-2 markers useful for the detection of SARS-CoV-2-positive samples across all variants; (ii) develop the smallest set of SNP markers that can be used to accurately assign lineages to SARS-CoV-2-positive samples (PPA ≥ 95% compared to NGS); and (iii) implement a genotyping approach and platform for the early detection of new and re-emerging variants that signals when markers need to be updated. There may be a bias in our results given that the retrospective and prospective clinical samples used to validate assays originated primarily from Washington, Florida, and California; however, subsequent studies using samples from a broader swath of states showed similar results (data not shown), and national and global knowledge of circulating strains at the time of this study suggest that any bias would be minimal.
This report identifies three variant agnostic markers that can detect SARS-CoV-2-positive samples with high PPA and NPA compared to NGS. These markers are present in almost all the SARS-CoV-2 samples that were sequenced and should be considered in the development of new assays. This study also demonstrates that some marker combinations are highly specific for certain variants. Routine use of these genotyping markers could provide early warning that a new or re-emergent variant is circulating. Importantly, genotyping with this assay is quick and efficient, enabling result reporting in 1 to 2 days, compared to 10 to 14 days with NGS, and for a fraction of the cost. As such, genotyping can be used to monitor a higher percentage of SARS-CoV-2-positive samples than the 5% percent random sampling by sequencing currently practiced in the US. Samples which cannot be assigned to a known variant would be prime candidates for sequencing. Finally, the study demonstrates that the Omicron variant can be identified with high precision with three markers. Incorporating Omicron-specific markers with the markers defined to detect previous variants can provide a framework for the detection of the next new variant.
The genotyping markers for Omicron effectively highlighted the transition from Delta to Omicron as the dominant variant. As illustrated in this report, a static marker set with the Delta-specific markers omitted experienced a significant decline in accuracy within four months of the emergence of Delta in the US. To prevent a loss of marker accuracy, this study demonstrated an approach for detecting new and emerging variants using a classifier algorithm for recurring analysis of active marker sets across regional and global GISAID sequence data. As emerging variants develop, anomalies in classifier calls and the resulting discordance with sequencing classification will continue to highlight the need for marker modifications. With the genotyping assay described herein, addition or subtraction of markers is straightforward. Each SNP marker, including variant agnostic positivity markers, is interrogated in a separate, individual well so that changing one marker has no impact on the performance of other markers in the set. The genotyping assay can be customized to include one or two positivity markers, as well as lineage assignment markers appropriate for the current variant landscape, to simultaneously confirm a SARS-CoV-2-positive sample and identify its genotype. Standardized performance metrics, such as limit of detection (LoD), sensitivity and specificity of custom successor assays, will be easy to establish. A formal LoD was outside the scope of this study, but positivity and genotype calls were obtained for samples with as few as 10 RNA copies per well.
A retrospective review of the emergence of Delta in the US showed that as this variant grew in prevalence, so too did the number of undetermined calls returned by the classifier algorithm. Thus, the classifier algorithm effectively assesses the accuracy of current marker sets based on daily analysis of new viral sequences added to GISAID, creating an adaptive and closed-loop process for low-cost, rapid monitoring of circulating variants and detection of emerging variants. Indeed, the authors recently created a free, live dashboard of a real-time genotyping platform illustrating the symbiotic nature of using genotyping markers in conjunction with targeted sequencing (20). An uptick in SARS-CoV-2-positive samples with undetermined genotyping variant classification will trigger targeted sequencing of this subset of samples to determine whether they represent a novel variant for which genotyping markers should be developed and incorporated. This real-time tracking tool will become increasingly powerful as more public health and private testing labs adopt this genotyping approach and contribute data.


The research reported in this publication was supported by the National Institute of Biomedical Imaging and Bioengineering of the National Institutes of Health (award numbers 75N92022P00030, 75N92021P00116, U54EB015408, and U54EB027690) as part of the Rapid Acceleration of Diagnostics (RADx) initiative, launched to speed innovation in the development, commercialization, and implementation of technologies for COVID-19 testing. The funders had no role in the decision to submit the work for publication, and the views expressed herein are the authors’ and do not necessarily represent the views of the National Institutes of Health or the US Department of Health and Human Services.
We gratefully acknowledge Biocomx contractors Dale Gort, Gail E. Radcliffe, Brian Walsh, and Marianne Weinell; Thermo Fisher Scientific employees Manohar Furtado, Anshu Gupta, Elvis Huarcaya Najarro, Paul Sportmann, and Rui Yang; Helix OpCo employees Marc Laurent, William Lee, and James Lu; and Aegis Sciences Corporation employees Cyndi Clark and Matthew Hardison for their support of this study.
Finally, we thank the originating laboratories responsible for obtaining the specimens, the submitting laboratories where genetic sequence data were generated and shared via the GISAID Initiative, and the GISAID EpiCov Data Curation Team.
E. Lai and R.S. Creager are co-principal investigators and secured funding for this study. E. Lai conceived the experiment and contributed to interpretation of resulting data. R.S. Creager was the lead author and editor of the manuscript. R.S. Creager is the guarantor of this work and, as such, had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis. E.B. Kennedy was the secondary author and editor of the manuscript and led the graphic design of tables and figures. K.M.C. O’Donovan was the tertiary author and editor of the manuscript. J. Lozach devised methods for marker selection and contributed to data analysis and interpretation. J. Lozach and J. Davis-Turak devised methods for lineage assignment. J. Lozach led design and J. Davis-Turak, C. Wesselman, and T. Wesselman oversaw development and deployment of the ROSALIND Tracker for COVID-19. C. Wesselman, T. Wesselman, K. Hayashibara, M. Gandhi, and S. Williams provided manuscript writing and editing assistance. K. Hayashibara and P. Brzoska led, and E. Diamond supported, genotyping assay development, general lab protocol development, and data acquisition. Y. Yu also supported genotyping assay development via retrospective bioinformatic analysis. K. Hayashibara, E. Diamond, M. Gandhi, J.M. Nguyen, J.M. Ramirez, and S Williams oversaw implementation of the protocol at partnering labs, and supported data analysis and interpretation. D. Becker, T. Cassens, N.A. Leonetti, and E. Sandoval contributed to Helix OpCo workflow design and implementation. T. Cassens and S. White managed genotyping and sequencing data acquisition and analysis at Helix OpCo. T. Peck and D. Wong co-designed and developed the genotyping assay robotics program. A.L. Greninger led UW Medical Center workflow design and implementation, and A.L. Greninger, P. Hajian, and P. Roychoudhury supported genotyping and sequencing data acquisition and analysis at UW Medical Center.


Wölfel R, Corman VM, Guggemos W, Seilmaier M, Zange S, Müller MA, Niemeyer D, Jones TC, Vollmar P, Rothe C, Hoelscher M, Bleicker T, Brünink S, Schneider J, Ehmann R, Zwirglmaier K, Drosten C, Wendtne C. 2020. Virological assessment of hospitalized patients with COVID-2019. Nature 581:465–469.
Tao K, Tzou PL, Nouhin J, Gupta RK, de Oliveira T, Pond SLK, Fera D, Shafer RW. 2021. The biological and clinical significance of emerging SARS-CoV-2 variants. Nat Rev Genet 22:757–773.
Korber B, Fischer WM, Gnanakaran S, Yoon H, Theiler J, Abfalterer W, Hengartner N, Giorgi EE, Bhattacharya T, Foley B, Hastie KM, Parker MD, Partridge DG, Evans CM, Freeman TM, de Silva TI, McDanal C, Perez LG, Tang H, Moon-Walker A, Whelan SP, LaBranche CC, Saphire EO, Montefiori DC., Sheffield COVID-19 Genomics Group. 2020. Tracking changes in SARS-CoV-2 spike: evidence that D614G increases infectivity of the COVID-19 virus. Cell 182:812–827.e19.
Hodcroft EB, Domman DB, Snyder DJ, Oguntuyo KY, Van Diest M, Densmore KH, Schwalm KC, Femling J, Carroll JL, Scott RS, Whyte MM, Edwards MW, Hull NC, Kevil CG, Vanchiere JA, Lee B, Dinwiddie DL, Cooper VS, Kamil JP. 2021. Emergence in late 2020 of multiple lineages of SARS-CoV-2 Spike protein variants affecting amino acid position 677. medRxiv.
Kupferschmidt K. 2021. Evolving threat. Science 373:844–849.
Tegally H, Wilkinson E, Lessells RJ, Giandhari J, Pillay S, Msomi N, Mlisana K, Bhiman JN, von Gottberg A, Walaza S, Fonseca V, Allam M, Ismail A, Glass AJ, Engelbrecht S, Van Zyl G, Preiser W, Williamson C, Petruccione F, Sigal A, Gazy I, Hardie D, Hsiao N, Martin D, York D, Goedhals D, San EJ, Giovanetti M, Lourenço J, Alcantara LCJ, de Oliveira T. 2021. Sixteen novel lineages of SARS-CoV-2 in South Africa. Nat Med 27:440–446.
Oude Munnink BB, Sikkema RS, Nieuwenhuijse DF, Molenaar RJ, Munger E, Molenkamp R, van der Spek A, Tolsma P, Rietveld A, Brouwer M, Bouwmeester-Vincken N, Harders F, Hakze-van der Honing R, Wegdam-Blans MCA, Bouwstra RJ, GeurtsvanKessel C, van der Eijk AA, Velkers FC, Smit LAM, Stegeman A, van der Poel WHM, Koopmans MPG. 2021. Transmission of SARS-CoV-2 on mink farms between humans and mink and back to humans. Science 371:172–177.
Callaway E. 2021. ‘A bloody mess’: confusion reigns over naming of new COVID variants. Nature 589:339.
National Institute for Communicable Diseases (NICD). 2021. Network for Genomic Surveillance in South Africa (NGS-SA). SARS-CoV-2 Sequencing Update. [cited 2022 Feb 8]. Available from https://www.nicd.ac.za/wp-content/uploads/2021/11/Update-of-SA-sequencing-data-from-GISAID-26-Nov_Final.pdf.
Maslo C, Friedland R, Toubkin M, Laubscher A, Akaloo T, Kama B. 2022. Characteristics and outcomes of hospitalized patients in South Africa during the COVID-19 Omicron wave compared with previous waves. JAMA 327:583.
Lewnard JA, Hong VX, Patel MM, Kahn R, Lipsitch M, Tartof SY. 2022. Clinical outcomes among patients infected with Omicron (B.1.1.529) SARS-CoV-2 variant in southern California. medRxiv.
Wolter N, Jassat W, Walaza S, Welch R, Moultrie H, Groome M, Amoako DG, Everatt J, Bhiman JN, Scheepers C, Tebeila N, Chiwandire N, Du Plessis M, Govender N, Ismail A, Glass A, Mlisana K, Stevens W, Treurnicht FK, Makatini Z, Hsiao N, Parboosing R, Wadula J, Hussey H, Davies M, Boulle A, von Gottberg A, Cohen C. 2021. Early assessment of the clinical severity of the SARS-CoV-2 Omicron variant in South Africa. medRxiv.
Washington NL, Gangavarapu K, Zeller M, Bolze A, Cirulli ET, Barrett KMS. 2021. Genomic epidemiology identifies emergence and rapid transmission of SARS-CoV-2 B.1.1.7 in the United States. medRxiv.
Martin DP, Weaver S, Tegally H, San JE, Shank SD, Wilkinson E, Lucaci AG, Giandhari J, Naidoo S, Pillay Y, Singh L, Lessells RJ, NGS-SA, COVID-19 Genomics UK (COG-UK), Gupta RK, Wertheim JO, Nekturenko A, Murrell B, Harkins GW, Lemey P, MacLean OA, Robertson DL, de Oliveira T, Kosakovsky Pond SL. 2021. The emergence and ongoing convergent evolution of the SARS-CoV-2 N501Y lineages. Cell 184:5189.e7–5200.e7.
Scott L, Hsiao N, Moyo S, Singh L, Tegally H, Dor G, Maes P, Pybus OG, Kraemer MUG, Semenova E, Bhatt S, Flaxman S, Faria NR, de Oliveira T. 2021. Track Omicron’s spread with molecular data. Science 374:1454–1455.
Public Health England (PHE) Genomics Cell, PHE Outbreak Surveillance Team. PHE Epidemiology Cell, PHE Contact Tracing Data Team, PHE Health Protection Data Science Team, PHE Joint Modelling Team, NHS Test and Trace Joint Biosecurity Centre, Public Health Scotland and EAVE group, Variant Technical Group Members. 2021. SARS-CoV-2 variants of concern and variants under investigation in England: Technical Briefing 14. [cited 2021 Dec 27]. PHE, London, United Kingdom. Available from https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/991343/Variants_of_Concern_VOC_Technical_Briefing_14.pdf.
Titus K. 2021. Looming unknowns with SARS-CoV-2 variants. CAP Today. [cited 2021 Dec 27]. College of American Pathologists (CAP), Northfield, IL. Available from https://www.captodayonline.com/looming-unknowns-with-sars-cov-2-variants/.
Harper H, Burridge A, Winfield M, Finn A, Davidson A, Matthews D, Hutchings S, Vipond B, Jain N, Edwards K, Barker G, G. 2021. Detecting SARS-CoV-2 variants with SNP genotyping. PLoS One 16:e0243185.
Creager R, Blackwood J, Pribyl T, Bassit L, Rao A, Greenleaf M, Frank F, Lam W, Ortlund E, Schinazi R, Greninger AL, Cirrincione M, Gort D, Kennedy EB, Samuta A, Shaw MK, Walsh B, Lai E. 2021. RADx Variant Task Force program for assessing the impact of variants on SARS-CoV-2 molecular and antigen Tests. IEEE Open J Eng Med Biol 2:286–290.
Rosalind. 2021. ROSALIND Tracker for COVID-19. ROSALIND [cited 2021 Dec 29] Available from https://tracker.rosalind.bio.
GISAID EpiCoV Database. 2021. Freunde von GISAID e.V. Germany. [cited 2021 Dec 28]. Available from https://www.gisaid.org.
Elbe S, Buckland-Merrett G. 2017. Data, disease and diplomacy: GISAID’s innovative contribution to global health. Glob Chall 1:33–46.
Khare S, Gurry C, Freitas L, Schultz MB, Bach G, Diallo A, Akite N, Ho J, Lee RT, Yeo W, Curation Team GC, Maurer-Stroh S. 2021. GISAID’s role in pandemic response. China CDC Wkly 3:1049–1051.
Harris RS. 2007. Improved pairwise alignment of genomic DNA. Dissertation. Pennsylvania State University, State College, PA. Available from http://www.bx.psu.edu/~rsharris/rsharris_phd_thesis_2007.pdf.
National Center for Biotechnology Information (NCBI). 2020. Severe acute respiratory syndrome coronavirus 2 isolate Wuhan-Hu-1, complete genome. [cited 2021 Dec 28]. Available from https://www.ncbi.nlm.nih.gov/nuccore/NC_045512.2/.
Obenchain V, Lawrence M, Carey V, Gogarten S, Shannon P, Morgan M. 2014. VariantAnnotation: a Bioconductor package for exploration and annotation of genetic variants. Bioinformatics 30:2076–2078.
R Core Team. 2021. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. [cited 2021 Dec 28]. Available from https://www.R-project.org/.
Clinical and Laboratory Standards Institute (CLSI). 2021. EP12-A2: user protocol for evaluation of qualitative test performance. [cited 2021 Dec 31]. Available from https://clsi.org/standards/products/method-evaluation/documents/ep12/.
Davis-Turak J, Courtney SM, Hazard ES, Glen WB, da Silveira WA, Wesselman T, Harbin LP, Wolf BJ, Chung D, Hardiman G. 2017. Genomics pipelines and data integration: challenges and opportunities in the research setting. Expert Rev Mol Diagn 17:225–237.
Association of Public Health Laboratories (APHL). 2021. National SARS-CoV-2 Strain Surveillance (NS3). [cited 2021 Dec 28]. Available from https://www.aphl.org/programs/preparedness/Crisis-Management/COVID-19-Response/Pages/Sequence-Based-Surveillance-Submission.aspx.
Thermo Fisher Scientific. 2021. TaqMan SARS-CoV-2 Mutation Panel user guide. [cited 2021 Dec 30]. Available from https://assets.thermofisher.com/TFS-Assets/LSG/manuals/MAN0024768_TaqManSARS-CoV-2_MutationPanel_UG.pdf.
Thermo Fisher Scientific. 2021. TaqMan SARS-CoV-2 Mutation Panel. [cited 2021 Dec 28]. Available from https://www.thermofisher.com/mutationpanel.
Thermo Fisher Scientific. 2022. TaqMan SARS-CoV-2 Mutation Panel checklist. [cited 2022 Apr 20]. Available from http://assets.thermofisher.com/TFS-Assets/GSD/Reference-Materials/taqman-sars-cov-2-mutation-panel-checklist.pdf.
Bolze A, Cirulli ET, Luo S, White S, Wyman D, Rossi AD, Machado H, Cassens T, Jacobs S, Barrett KM, Tsan KMS, Nguyen K, Ramirez J, Jm III, Sandoval E, Wang X, Wong D, Becker D, Laurent M, Lu JT, Isaksson M, Washington NL, Lee W. 2021. Rapid displacement of SARS-CoV-2 variant B.1.1.7 by B.1.617.2 and P.1 in the United States. medRxiv.
Addetia A, Lin MJ, Peddu V, Roychoudhury P, Jerome KR, Greninger AL. 2020. Sensitive recovery of complete SARS-CoV-2 genomes from clinical samples by use of Swift Biosciences’ SARS-CoV-2 Multiplex Amplicon Sequencing Panel. J Clin Microbiol 59:e02226-20.
Genomics and Diagnostics Team, Virology Surveillance and Diagnosis Branch, Influenza Division, National Center for Immunization and Respiratory Diseases, Centers for Disease Control and Prevention (CDC). 2021. Research Use Only CDC Influenza SARS-CoV-2 (Flu SC2) Multiplex Assay primers and probes. CDC, Atlanta, GA. [cited 2021 Dec 28]. Available from https://www.cdc.gov/coronavirus/2019-ncov/lab/multiplex-primer-probes.html.
O’Toole Á, Scher E, Underwood A, Jackson B, Hill V, McCrone JT, Colquhoun R, Ruis C, Abu-Dahab K, Taylor B, Yeats C, Du Plessis L, Maloney D, Medd N, Attwood SW, Aanensen DM, Holmes EC, Pybus OG, Rambaut A. 2021. Assignment of epidemiological lineages in an emerging pandemic using the Pangolin tool. Virus Evol 7:veab064.
Center for Systems Science and Engineering (CSSE), Johns Hopkins University (JHU). 2022. COVID-19 dashboard. Johns Hopkins University School of Medicine, Baltimore, MD. [cited 2022 Feb 8]. Available from https://coronavirus.jhu.edu/map.html.
Neopane P, Nypaver J, Shrestha R, Beqaj SS. 2021. Detection Using TaqMan SARS-CoV-2 Mutation Panel molecular genotyping assays. Infect Drug Resist 14:4471–4479.
Mullen JL, Tsueng G, Latif AA, Alkuzweny M, Cano M, Haag E. 2021. SuLab [cited 2021 Dec 28]. Available from: https://outbreak.info/.
Korukluoglu G, Kolukirik M, Bayrakdar F, Ozgumus GG, Altas AB, Cosgun Y, Kolukirik CZK. 2021. 40 minutes RT-qPCR assay for screening Spike N501Y and HV69-70del mutations. bioRxiv.
Vogels CBF, Breban MI, Ott IM, Alpert T, Petrone ME, Watkins AE, Kalinich CC, Earnest R, Rothman JE, Goes de Jesus J, Morales Claro I, Magalhães Ferreira G, Crispim MAE, Singh L, Tegally H, Anyaneji UJ, Hodcroft EB, Mason CE, Khullar G, Metti J, Dudley JT, MacKay MJ, Nash M, Wang J, Liu C, Hui P, Murphy S, Neal C, Laszlo E, Landry ML, Muyombwe A, Downing R, Razeq J, de Oliveira T, Faria NR, Sabino EC, Neher RA, Fauver JR, Grubaugh ND, Network for Genomic Surveillance in South Africa. 2021. Multiplex qPCR discriminates variants of concern to enhance global surveillance of SARS-CoV-2. PLoS Biol 19:e3001236.
Perchetti GA, Zhu H, Mills MG, Shrestha L, Wagner C, Bakhash SM, Lin MJ, Xie H, Huang M, Mathias P, Bedford T, Jerome KR, Greninger AL, Roychoudhury P. 2021. Specific allelic discrimination of N501Y and other SARS-CoV-2 mutations by ddPCR detects B.1.1.7 lineage in Washington State. J Med Virol 93:5931–5941.
Washington NL, White S, Barrett KMS, Cirulli ET, Bolze A, Lu JT. 2020. S gene dropout patterns in SARS-CoV-2 tests suggest spread of the H69del/V70del mutation in the US. medRxiv.
Boršová K, Paul ED, Kováčová V, Radvánszka M, Hajdu R, Čabanová V, Sláviková M, Ličková M, Lukáčiková L, Belák A, Roussier L, Kostičová M, Líšková A, Maďarová L, Štefkovičová M, Reizigová L, Nováková E, Sabaka P, Koščálová A, Brejová B, Staroňová E, Mišík M, Vinař T, Nosek J, Čekan P, Klempa B. 2021. Surveillance of SARS-CoV-2 lineage B.1.1.7 in Slovakia using a novel, multiplexed RT-qPCR assay. Sci Rep 11:20494.
European Centre for Disease Prevention and Control, WHO Regional Office for Europe. 2021. Methods for the detection and identification of SARS-CoV-2 variants. [cited 2022 Feb 8]. Available from https://apps.who.int/iris/handle/10665/340067.
Wang H, Jean S, Eltringham R, Madison J, Snyder P, Tu H, Jones DM, Leber AL. 2021. Mutation-specific SARS-CoV-2 PCR screen: rapid and accurate detection of variants of concern and the identification of a newly emerging variant with Spike L452R mutation. J Clin Microbiol 59:e00926-21.
Brejová B, Boršová K, Hodorová V, Čabanová V, Reizigová L, Paul ED, Čekan P, Klempa B, Nosek J, Vinař T. 2021. A SARS-CoV-2 mutant from B.1.258 lineage with ΔH69/ΔV70 deletion in the Spike protein circulating in Central Europe in the fall 2020. Virus Genes 57:556–560.
Haim-Boukobza S, Roquebert B, Trombert-Paolantoni S, Lecorche E, Verdurme L, Foulongne V, Selinger C, Michalakis Y, Sofonea MT, Alizon S. 2021. Detecting rapid spread of SARS-CoV-2 variants, France, January 26–February 16, 2021. Emerg Infect Dis 27:1496–1499.

Information & Contributors


Published In

cover image Journal of Clinical Microbiology
Journal of Clinical Microbiology
Volume 60Number 720 July 2022
eLocator: e00342-22
Editor: Yi-Wei Tang, Cepheid
PubMed: 35766514


Received: 2 March 2022
Returned for modification: 28 March 2022
Accepted: 4 May 2022
Published online: 29 June 2022


  1. COVID-19
  2. Delta
  3. genotyping
  4. in vitro diagnostics
  5. mutations
  6. Omicron
  7. RT-PCR
  8. SARS-CoV-2
  9. variants
  10. next-generation sequencing



Eric Lai
Personalized Science, Burlington, Vermont, USA
Emily B. Kennedy
OOMVELT, Lakewood, Ohio, USA
Jean Lozach
ROSALIND, San Diego, California, USA
Kathleen Hayashibara
Thermo Fisher Scientific, South San Francisco, California, USA
Jeremy Davis-Turak
ROSALIND, San Diego, California, USA
David Becker
Helix OpCo, San Diego, California, USA
Pius Brzoska
Thermo Fisher Scientific, South San Francisco, California, USA
Tyler Cassens
Helix OpCo, San Diego, California, USA
Evan Diamond
Thermo Fisher Scientific, South San Francisco, California, USA
Manoj Gandhi
Thermo Fisher Scientific, South San Francisco, California, USA
University of Washington Medical Center, Seattle, Washington, USA
Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
Pooneh Hajian
University of Washington Medical Center, Seattle, Washington, USA
Nicole A. Leonetti
Helix OpCo, San Diego, California, USA
Jason M. Nguyen
Helix OpCo, San Diego, California, USA
K. M. Clair O’Donovan
Biocomx, Dana Point, California, USA
Troy Peck
Helix OpCo, San Diego, California, USA
Jimmy M. Ramirez III
Helix OpCo, San Diego, California, USA
University of Washington Medical Center, Seattle, Washington, USA
Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
Efren Sandoval
Helix OpCo, San Diego, California, USA
Cassandra Wesselman
ROSALIND, San Diego, California, USA
Timothy Wesselman
ROSALIND, San Diego, California, USA
Simon White
Helix OpCo, San Diego, California, USA
Stephen Williams
Thermo Fisher Scientific, South San Francisco, California, USA
David Wong
Helix OpCo, San Diego, California, USA
Yufei Yu
Biocomx, Dana Point, California, USA
Seegene Technologies, Walnut Creek, California, USA


Yi-Wei Tang


The authors declare no conflict of interest.

Metrics & Citations


Note: There is a 3- to 4-day delay in article usage, so article usage will not appear immediately after publication.

Citation counts come from the Crossref Cited by service.


If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

View Options

Figures and Media






Share the article link

Share with email

Email a colleague

Share on social media

American Society for Microbiology ("ASM") is committed to maintaining your confidence and trust with respect to the information we collect from you on websites owned and operated by ASM ("ASM Web Sites") and other sources. This Privacy Policy sets forth the information we collect about you, how we use this information and the choices you have about how we use such information.
FIND OUT MORE about the privacy policy