INTRODUCTION
A novel variant of coronavirus termed severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) was promptly identified as the etiologic agent of a pneumonia outbreak in Wuhan, China, in December 2019 (
1). This zoonotic virus spread rapidly among the immunologically naive human populations of the world in only a few months, creating the biggest pandemic crisis since the swine flu in 2009-2010 (
2). SARS-CoV-2 is classified in the beta-coronavirus group upon phylogenetic analysis, sharing high genomic identity (96%) with the bat coronavirus RaTG13 and 80% nucleotide identity with the 2002-2003 pandemic SARS-CoV (
3). Conversely, its genetic relatedness to other human beta-coronaviruses, such as OC-43 and HKU-1, and the human alpha-coronaviruses 229E and NL63 is significantly lower (
4). The laboratory diagnosis of SARS-CoV2 infection is routinely made by the detection of viral RNA in nasopharyngeal samples in symptomatic patients by real-time reverse transcriptase PCR assays (
5). The sensitivity of the molecular assays is good in the acute phase of the disease (
6,
7), but they cease to be useful once the virus is cleared by the immune response of the patient, around 2 weeks following the onset of symptoms (
8). The lack of sensitivity of the molecular assay as a function of time could impair the correct clinical assignment of patients, and this could contribute to the spread of the virus. In addition, some reports indicated that asymptomatic carriers who are not routinely tested could also contribute to spreading the virus (
9).
Serological methods were proposed for use as screening tools for the epidemiological assessment of the immune profiles of the general population (
10). Serological assays could also help to assess the immune status of convalescent-phase sera used to treat severely ill patients or for the validation of the effectiveness of vaccination trials. While serology is an indirect assay, it can provide useful insights about the exposition of asymptomatic individuals by the detection of specific antibodies. Thus, the challenge posed by this novel viral variant is the selection of highly specific antibody detection assays which could discriminate this virus from other related viruses and also be sensitive enough to be useful in the acute phase of the disease. Because of the necessity to screen immune profiles of several thousands of individuals in a short period of time, the validation of several assays available on high-throughput serology analyzers is a prerequisite to accomplish this task. In this study, we sought to evaluate the clinical performance and the virus neutralization activity prediction properties of 10 high-throughput serological enzyme immunoassays (EIA). We also evaluated three lateral flow rapid antibody detection devices, since these tests could be useful for screening purposes in distant settings.
MATERIALS AND METHODS
Study design.
This multicenter evaluation study, initiated by the Québec provincial health authorities, was a collective effort led by the Laboratoire de Santé Publique du Québec (LSPQ). A total of 10 clinical laboratories were selected for this study, including the reference laboratory (LSPQ). These laboratories were grouped in separate testing sites according to their affiliations to university hospital centers, as follows: site 1, Centre Hospitalier de l’Université de Montréal (CHUM) and Centre Hospitalier Universitaire Sainte-Justine; site 2, Centre Hospitalier de l’Université Laval, Hôpital de l’Enfant-Jésus, and Hôpital du Saint-Sacrement; site 3, Hôpital Cité de la Santé; site 4, McGill University Health Center–Glen Site and St. Mary’s Hospital; site 5, Centre Hospitalier de l’Université de Sherbrooke; and site 6, LSPQ.
This study was conducted in accordance with the Declaration of Helsinki in terms of informed consent and approval by an appropriate institutional board and with the regulations concerning ethical reviews in Quebec provincial jurisdiction, particularly those specified in the Civil Code (
http://legisquebec.gouv.qc.ca/fr/ShowDoc/cs/CCQ-1991) and in subsequent internal review board practice. Informed consent was obtained for all participating subjects, and the study was approved by Quebec Public health authorities.
Ten commercial antibody assays running on high-throughput immunoanalyzer platforms currently found in hospital laboratories and three lateral-flow chromatographic immunoassays (LFCIA) were selected for the study and tested at six designated sites. Each of the assays was tested in two separate laboratories to assess the reproducibility of the data except for the LFCIA which were tested only once at site 6. All of the selected assays had received the emergency usage authorization (EUA) by the Food and Drug Administration of the United States, while seven commercial enzyme immunoassays (Abbott IgG, Bio-Rad total antibody, DiaSorin IgG, Euroimmun IgG, Ortho-Clinical total antibody, Roche total antibody, and Siemens total antibody) had been authorized for use in clinical settings by Health Canada.
Patient cohorts.
SARS CoV-2 samples (sensitivity panel;
n = 176) were collected from patients with various clinical symptoms, including 156 sera and 20 plasma samples (
Table 1). All 176 patients were confirmed positive for SARS CoV-2 infection by real-time reverse transcription-PCR (RT-PCR) on nasopharyngeal specimens; 170 of these patients were symptomatic and six were asymptomatic. The average age of the patients was 61 years. Fifty-six (32%) were male, 100 (57%) were female, and 20 (11%) were of unspecified gender. Forty-six samples were obtained at 1 to 7 days, 36 between 8 and 14 days, 60 between 15 and 34 days, and 34 more than 5 weeks after symptom onset. For asymptomatic patients as well as few symptomatic patients without a precise date of symptom manifestations, the time of onset was estimated as 48 h before the first positive SARS CoV-2 RT-PCR result. Among the patients with complete clinical information (
n = 48), 32 (67%) were hospitalized, 29 (60%) required oxygen support, 8 (17%) were admitted to intensive care units, and 42 (88%) reported one or more symptoms (
Table 1). The reported symptoms included one or more of the following: fever, myalgia, anosmia, ageusia, nasal congestion, pharyngitis, anorexia, and diarrhea.
The first 67 SARS-CoV-2 samples collected for the sensitivity panel were selected for virus neutralization activity measurements. Twenty-one samples were obtained at 1 to 7 days, 14 between 8 and 14 days, 26 between 15 and 34 days, and 6 more than 5 weeks after symptom onset.
Specificity and cross-reactivity of the serological assays were evaluated using samples from patients with other laboratory-confirmed acute infections (n = 64): common cold human coronavirus (n = 20), adenovirus (n = 8), enterovirus (n = 2), influenza virus A (n = 3), and B (n = 1), parainfluenza virus 2/3 (n = 2), rhinovirus (n = 6), syncytial respiratory virus (n = 1), dengue virus (n = 20), and varicella-zoster virus (n = 1). In addition, patient samples with autoimmune antibodies (n = 51) and those collected from the general population prior to January 2020 (n = 38) were tested. Samples collected in 2020 from suspected COVID-19 patients with a negative RT-PCR result (n = 13: April, n = 11, and May, n = 2) were also included for specificity assessment. The first confirmed locally acquired COVID-19 cases in Quebec were in March 2020.
A panel to assess test sensitivity was prepared by collecting serum samples from confirmed COVID-19 patients prospectively in sufficient amounts to accommodate all serology platforms at each site. Because of the small volumes of several patient samples dedicated to the specificity panel, we opted to pool two to four samples from patients sharing the same laboratory findings. With the exception of sera obtained from patients with autoimmune diseases, all other samples were prepared by a pooling procedure.
Evaluation of serological assays.
Testing of the commercial immunoassays using various reagent chemistries (enzyme immunosorbent assay [ELISA], chemiluminescence immunoassay [CLIA], electrochemiluminescence immunoassay [ECLIA], and chemiluminescent microparticles immunoassay [CMIA]) and different SARS-CoV-2 antigens (nucleocapsid, spike, and spike domains/epitope) was performed according to the manufacturer’s instructions (
Table 2). Qualitative results were derived based upon the clinically based cutoff values determined by the manufacturers and described in the package inserts. All platforms used their built-in associated programmed protocols, calibrators, and controls for quality control assessment for each assay. All assays were validated to utilize both serum and plasma samples. Five assays were evaluated for the detection of IgG against SARS-CoV-2, while four assays were examined for both IgM and IgG. Finally, one assay was evaluated for the detection of IgA. All sites received the same panels of samples for performance assessment. All assays were performed in two separate sites in order to measure the interlaboratory reproducibility. Qualitative results (positive, negative, and equivocal) were inferred using the manufacturer’s specified cutoff values. Equivocal results were considered negative in the calculation of sensitivity and specificity.
Three lateral-flow rapid antibody detection devices were tested per the manufacturer’s indications at only one site (site 6). All tests were designed to detect both IgM and IgG antibodies in serum or plasma samples. All results were recorded by visual inspection of reactive bands by two technologists. Quality control was assessed by the observation of a colored band on the migration matrices at the control position. A positive result was scored by one technologist when a strong colored band for either IgM and/or IgG appeared at the expected positions, while weak signals needed a consensus between two technologists for a final positive result. Invalid assays with a negative-control band were repeated until a valid result was obtained. We obtained 13 and six invalid tests for BTNX and Biocan, respectively, and none for NADAL.
Virus neutralization activity assessment of SARS-CoV-2 patient sera.
Sixty-seven COVID-19 patients’ samples in our sensitivity panel were assessed for virus neutralization activity using a pseudovirus neutralization protocol described previously (
11–14). Target cells were infected with single-round luciferase-expressing lentiviral particles. Briefly, 293T cells were transfected by the calcium phosphate method with the lentiviral vector pNL4.3.Luc.R-E-(NIH AIDS Reagent Program) and a plasmid encoding SARS-CoV-2 spike at a ratio of 5:4. Two days posttransfection, cell supernatants were harvested and stored at −80°C until use. 293T-ACE2 target cells were seeded at a density of 1 × 10
4 cells/well in 96-well luminometer-compatible tissue culture plates (Perkin Elmer) 24 h before infection. Recombinant viruses in a final volume of 100 μl were incubated with various serum dilutions (1/50, 1/250, 1/1,250, 1/6,250, and 1/31,250) for 1 h at 37°C and were then added to the target cells followed by incubation for 48 h at 37°C; cells were lysed by the addition of 30 μl of passive lysis buffer (Promega) followed by one freeze-thaw cycle. A LB942 TriStar luminometer (Berthold Technologies) was used to measure the luciferase activity of each well after the addition of 100 μl of luciferin buffer (15 mM MgSO
4, 15 mM KPO
4 [pH 7.8], 1 mM ATP, and 1 mM dithiothreitol) and 50 μl of 1 mM
d-luciferin potassium salt (Thermo Fisher Scientific). The neutralization half-maximal inhibitory dilution (ID
50) represents the serum dilution that inhibited 50% of the infection of 293T-ACE2 cells by recombinant viruses.
Calculations and statistical analyses.
Contingency tables were used to compute sensitivity (Se), specificity (Sp), positive predictive values (PPV), and negative predictive values (NPV) as a function of prevalence and total agreement. All statistical analyses were calculated using GraphPad Prism version 8.4.3 (GraphPad, San Diego, CA, USA). The exact binomial proportion 95% confidence intervals (Clopper and Pearson method) were calculated for sensitivity and specificity parameters. Spearman correlation analysis was performed to assess the neutralization prediction capabilities of the commercial assays using data generated by the neutralization and enzyme immunoassays. The determination of optimized cutoff values of the serological assays was accomplished using receiver operating characteristic (ROC) curves and the Youden index. The interrater agreement (Cohen’s kappa) coefficient and total agreement were used to compare the interlaboratory reproducibility of enzyme immunoassays.
DISCUSSION
SARS-CoV-2 serological assays should exhibit excellent sensitivities and specificities to be useful as clinical tools for public health investigations or as a diagnostic aid in particular clinical cases. Health Canada recommends that the Se and Sp of serological assays be higher than or equal to 95% and 98%, respectively, in samples collected 2 weeks or more after the onset of symptoms. Unfortunately, none of the evaluated commercial enzyme immunoassays achieved this level of accuracy using the cutoff values suggested by the companies. Within the rapid-device category, only the NADAL assay was able to comply with the minimal requirements in this time window.
SARS-CoV-2 seroprevalence studies in low-prevalence settings such as those seen in the general population need highly specific assays to rule out potential false-positive reactions resulting from past or present unrelated infections or health conditions. Public health studies of the COVID-19 prevalence in various cohorts from distinct regions in the province of Quebec are ongoing, but preliminary data indicate a seroprevalence of 3% in blood donors 18 to 69 years old in the greater Montreal area (
www.hema-quebec.qc.ca). Seven immunoassays examined in this study are predicted to have excellent specificity (98% to 100%) and good to excellent PPV (82% to 100%) when used in a low (5%)-seroprevalence setting (
Table 5). Accordingly, these assays could potentially be helpful to get a more precise epidemiologic assessment of SARS-CoV-2 exposition following outbreaks in different population settings, such as nursing homes and schools, to name a few.
Similarly to other studies (
18–21), we observe a direct relationship between the sensitivity rates of serological assays and time of sample collection following the onset of symptoms. Long et al. reported a median time for IgM and IgG seroconversion of 13 days after symptom onset (
22). This finding is consistent with the lack of Se observed for most assays in the acute phase. Overall, we observed Se values as low as 74% and as high as 95% at ≥15 days, while this range increases to 79% to 100% at ≥5 weeks after symptom onset. According to a meta-analysis of the accuracy of serological tests, the Se of different types of serological assays was highest when they were used with samples collected at least 3 weeks after symptom onset (
23). The authors reported Se values ranging from 79.7 to 98.9%, 15 days or more following the onset of symptoms.
We also noted that most of our time-stratified Se values were lower than those reported in the company package inserts. For instance, we obtained 85% and 83% sensitivities for the Roche and Abbott assays in the ≥15-day period, while the companies claimed 100% sensitivity for the same time frame. Similar to our findings, other studies reported lower Se values, varying from 73% to 89% and 82% to 92% for the Roche and Abbott assays, respectively (
19–21). Clinical performance characteristic variations between serological studies are not unusual and are highly dependent on the patient cohort used for evaluation. Our patient cohort consisted essentially of symptomatic patients with limited available clinical information. Among the 176 patients, only 48 had complete clinical information, including 32 hospitalized patients and 8 patients at the intensive care unit (ICU). Therefore, it was not possible to stratify our data according to the patient setting (outpatient, ward, ICU). Furthermore, it was demonstrated in several studies that the levels of antibodies in COVID-19 patients are significantly higher in severely ill patients (
24,
25). It is possible that our cohort of patients presented a moderate level of disease, which could translate to a lower level of antibody detection.
Regarding the rapid antibody detection devices, we obtained relatively good Se and Sp values which were lower than those claimed by the manufacturers but similar to those in published studies (
19,
26).
Considering the low Se values reported to date, it is quite clear that serology cannot be used to diagnose acute infections and should rather be used to assess SARS-CoV-2 exposure, as recommended by public health authorities. The lack of Se documented in the acute phase of COVID-19 disease represents a yet-unresolved problem that could be bypassed by several proposed strategies. For example, an orthogonal algorithm using two consecutive enzyme immunoassays with improved global analytical accuracy (
27) was proposed by the Centers for Disease Control and Prevention (
28). Another path toward better clinical performance could be achieved by the development of second-generation immunoassays with recalibrated cutoff values by manufacturers, as proposed in numerous validation studies (
18,
20). Collectively, the determination of optimized cutoff values in our study through ROC curve analyses had a significant impact on the diagnostic resolution of several enzyme immunoassays by increasing the sensitivity significantly without a large trade-off in specificity.
Another medical application for serology is the assessment of the immune status of convalescent-phase plasma collected from recovered SARS-CoV-2 patients. Immune-plasma therapy is considered a therapeutic option to treat very ill patients for whom there is no curative treatment (
29). However, assays designed to measure the virus-neutralizing activities of antibodies are tedious and necessitate biosafety level 3 (BSL3) conditions. Alternatively, pseudovirus neutralization assays are performed in BSL2 research laboratories but cannot be implemented easily in clinical laboratories. The virus neutralization assays typically used the spike protein to assess the presence of neutralizing antibodies (nAb). The spike protein was shown to be crucial for SARS-CoV and SARS-CoV-2 to bind to angiotensin-converting enzyme 2 (ACE2) as the entry receptor on mammalian cells via the receptor binding domain (RBD) epitope of their S1 domain (
30). Intuitively, serological assays which utilize spike protein should be able to detect antibodies having virus neutralization characteristics more efficiently than nucleocapsid-based assays. We titrated 67 samples from COVID-19 patients for the presence of nAb by pseudovirus neutralization assays and performed correlation statistics in order to determine whether the enzyme immunoassays routinely used in clinical settings could predict the presence of nAb in patient samples. Correlation analysis using a paired data set for each sample using the ID
50 values of the neutralization assay and the signal-to-cutoff values of enzyme immunoassays indicated a moderate (nucleocapsid-based assays) to high (spike-based assays) strength of association between nAb and binding antibodies. In fact, the spike-based assays showed the highest coefficient of association (
r; 0.67 to 0.77). Therefore, these results indicate that spike-based immunoassays could be useful as correlates for the detection of nAb in patient samples or alternatively to evaluate the immune status of vaccinated populations. In accordance with our data, the study by Muecksch et al. (
31) showed that commercial spike-based serological assay measurements were better predictors of virus neutralization potency than results of nucleocapsid-based assays. Another study reported that commercially available serological tests have various degrees of accuracy in predicting nAb activity in convalescent-phase plasma (
32). However, the authors of that study did not observe a correlation between antigen type used in serological assays and the predictability of nAb activity levels. Clearly, additional investigations, including functional and structural studies about the native conformation of the whole antigen or subdomains or the glycosylation status of the recombinant spike protein could potentially help to develop enhanced assays.
Furthermore, it will be important to develop assays sensitive enough to detect antibodies in patients enrolled in SARS-CoV-2 vaccination trials. Vaccine development research groups are mostly focusing one spike protein or the RBD, since these proteins/peptides were shown to elicit the production of potent nAb and to confer protection in nonhuman primates challenged with SARS-CoV-2 (
33–35).
An important limitation of this study is related to the possible dilution effect of using a pooling procedure for the preparation of the samples for the specificity panel. Due to potential variation of antibody levels among patients, this could potentially reduce the final concentration of cross-reacting antibodies below the detection threshold for any particular kit and so increase the apparent specificity of that assay. Conversely, cross-reacting antibodies were detected in pooled or nonpooled samples (autoimmune panel) with the Euroimmun assays (IgA and IgG). In this case, other factors, such as antigen preparation (peptide conformation and glycosylation status), may contribute to the high false-positive rates observed with these assays.
In this study, we evaluated 10 enzyme immunoassays and three lateral-flow chromatography assays. The specificity was excellent for the majority of the assays, as they were probably designed to be very specific by the manufacturers at the expense of sensitivity. However, several scenarios exist to increase their clinical performance. Orthogonal algorithm testing and optimized enzyme immunoassays should be validated promptly and implemented in clinical settings. As more data accumulate, spike-based immunoassays seem to be better correlates of neutralization activity and could potentially be used to characterize plasma samples from recovered patients in routine clinical laboratories. In the face of the putative second wave of COVID, it will be important to investigate the immunity of different populations. With this knowledge at hand, it will be possible in the long term to examine the protection levels conferred by nAb in immunized individuals from natural infection or vaccination as the pandemic continues to spread.