ABSTRACT

Deployment of clinical testing on a massive scale was an essential control measure for curtailing the burden of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infections and the magnitude of the COVID-19 (coronavirus disease 2019) pandemic during its waves. As the pandemic progressed, new preventive and surveillance mechanisms emerged. Implementation of vaccine programs, wastewater (WW) surveillance, and at-home COVID-19 antigen tests reduced the demand for mass SARS-CoV-2 testing. Unfortunately, reductions in testing and test reporting rates also reduced the availability of public health data to support decision-making. This paper proposes a sequential Bayesian approach to estimate the COVID-19 test positivity rate (TPR) using SARS-CoV-2 RNA concentrations measured in WW through an adaptive scheme incorporating changes in virus dynamics. The proposed modeling framework was applied to WW surveillance data from two WW treatment plants in California; the City of Davis and the University of California, Davis campus. TPR estimates are used to compute thresholds for WW data using the Centers for Disease Control and Prevention thresholds for low (<5% TPR), moderate (5%–8% TPR), substantial (8%–10% TPR), and high (>10% TPR) transmission. The effective reproductive number estimates are calculated using TPR estimates from the WW data. This approach provides insights into the dynamics of the virus evolution and an analytical framework that combines different data sources to continue monitoring COVID-19 trends. These results can provide public health guidance to reduce the burden of future outbreaks as new variants continue to emerge.

IMPORTANCE

We propose a statistical model to correlate WW with TPR to monitor COVID-19 trends and to help overcome the limitations of relying only on clinical case detection. We pose an adaptive scheme to model the nonautonomous nature of the prolonged COVID-19 pandemic. The TPR is modeled through a Bayesian sequential approach with a beta regression model using SARS-CoV-2 RNA concentrations measured in WW as a covariable. The resulting model allows us to compute TPR based on WW measurements and incorporates changes in viral transmission dynamics through an adaptive scheme.

INTRODUCTION

Effectively monitoring the evolution of the COVID-19 (coronavirus disease 2019) pandemic and controlling the spread of disease remains a major public health challenge. Statistical and mathematical models are important components of effective monitoring systems to track COVID-19 cases, hospitalizations, and deaths (1, 2). Unfortunately, the rapid evolution of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and variable community responses to public health interventions complicate the development of robust mathematical models. Classic models in epidemiology are limited in their ability to capture the complex dynamics of the evolving pandemic (3, 4). Data availability and quality have also changed over time (5, 6). Deployment of clinical testing on a massive scale (“mass testing”) was an essential control measure for curtailing the burden of COVID-19, particularly during its early phases. As the pandemic progressed, new interventions and monitoring strategies surged, including vaccine programs, wastewater (WW) surveillance (7, 8), and at-home COVID-19 antigen tests (9). In this study, we recognize the increasingly important role of WW surveillance data in disease monitoring and use WW data and statistical modeling to infer public health metrics.
Mass testing, contact tracing, isolation, and mobility restrictions made it possible to estimate the burden of disease during the early phases of the pandemic (10, 11). The return to “normal” was accompanied by a decrease in COVID-19 clinical testing programs and an increase in at-home diagnostic tests. These changes diminished the utility of individual case data for public health decision-making. Using the number of confirmed cases to determine the prevalence of disease in a community introduces bias since case counts depend on the volume of tests conducted, testing priorities, and the timing of case detection (12, 13). Test positivity rate (TPR) has been shown to be a better indicator of disease spread than confirmed cases because it considers both tests conducted and cases detected (14, 15).
During the COVID-19 pandemic, public health officials commonly used the TPR to infer the adequacy of population-level testing and the extent of SARS-CoV-2 transmission in a population (15 - 17). A low TPR indicates a low level of virus transmission and reflects a high surveillance capacity and rapid case identification. In contrast, a higher TPR indicates a higher level of virus transmission but also suggests that too few tests are being conducted and many infected individuals are likely undetected (18 - 20).
At the beginning of the pandemic, the World Health Organization (WHO) recommended a TPR threshold of 5% to declare COVID-19 transmission under control (21). As the number of people being tested for COVID-19 declined over time, the TPR alone was deemed insufficient to assess community-level transmission. Limited levels of testing meant that public health authorities focused on passive case-finding (i.e., only those considered most likely to be infected due to symptoms or contacts were tested). As a result, TPR tended to be artificially high, and models using TPR as input tended to overestimate the proportion of people infected. This is contrary to models based only on observed cases, which may underestimate COVID-19 prevalence (22). Despite these limitations, TPR can still provide a reasonable estimate of the extent of an outbreak if TPR is combined with additional information.
Public health authorities are turning to wastewater-based epidemiology (WBE) as an alternative strategy for less-biased population-level surveillance of SARS-CoV-2 RNA. WBE uses biomarkers in WW to monitor trends in community-level health indices. WBE methods have been used to detect changes in drug consumption (23), dietary patterns (24), and the circulation of pathogens like poliovirus and norovirus (25). SARS-CoV-2 RNA measurements in WW can be used to understand COVID-19 epidemiology because infected individuals shed the virus into the sewer system throughout their infection (8). During the COVID-19 pandemic, SARS-CoV-2 RNA concentrations in WW were shown to correlate strongly with confirmed cases in numerous studies (e.g., 8, 22, 26). However, some studies have shown that the relationship between WW and COVID-19 clinical cases varies over time (27, 28). This relationship is affected by many factors, including testing availability and practices, public health policies, social behaviors, vaccine uptake, acquired immunity, and the emergence of new variants that may impact fecal shedding (27, 29, 30). Xiao et al. (27) attributed changes in the ratio of WW signal to daily new positive clinical tests to insufficient testing in the general population (i.e., inadequate testing to capture the exponential growth of actual COVID-19 cases).
In this paper, we correlate WW with TPR to monitor COVID-19 trends using high-quality clinical testing and WW data from Davis, California. We pose an adaptive scheme to model the nonautonomous nature of the prolonged COVID-19 pandemic. The TPR is modeled through a sequential Bayesian approach (3) with a beta regression using the WW viral loads as a covariable. The resulting model allows us to compute a TPR based on WW measurements and incorporates changes in viral transmission dynamics through an adaptive scheme. The TPR estimate is used to calculate values for WW data that correspond to TPR thresholds using criteria proposed by the U.S. Centers for Disease Control and Prevention (CDC) in 2021 (31). The TPR thresholds indicated low transmission for a TPR <0.05, moderate transmission for a TPR within a range of 0.05 and 0.08, substantial transmission for a TPR within a range of 0.08 and 0.1, and high transmission for a TPR ≥0.1. Due to uncertainties in relating WW data to COVID-19 case counts, public health authorities typically evaluate trends in WW data to assess changes in infection rates rather than absolute thresholds. It may also be helpful for public health authorities to interpret WW data in terms of TPR. Our modeling approach provides insights into the evolution of virus transmission dynamics and a methodology that combines different sources of information to continue monitoring COVID-19 trends.

RESULTS

The analytical framework was developed using data from the City of Davis (Davis) and replicated for the University of California, Davis campus (UC Davis). To display changes in the relation between TPR and WW signal across variants, we estimate the TPR using the proposed sequential model and compute the posterior distribution for the parameter β1 over time (Fig. 1 and 2, top-panel). The results show how the relation of TPR and WW signal changed over time, mainly as new variants emerged. Note that while these were new periods of a variant emerging, this also likely corresponded to a period of under-testing when variants began to surge.
Fig 1
Fig 1 Posterior distribution of β1 (A) and estimated TPR for the City of Davis (B). Red-solid line and blue-shadow area describe the median and 95% prediction intervals, respectively.
Fig 2
Fig 2 Posterior distribution of β1 (A) and estimated TPR for UC Davis (B). Red-solid line and blue-shadow area describe the median and 95% prediction intervals, respectively.
The Algorithm 1 was used to compute the thresholds for WW data with the estimated TPR in each time window. We did not calculate thresholds for the period of September through December 2021 since the corresponding posterior distribution for β1 contained zero. In other words, the probability that the β1 parameter will be zero was positive, suggesting that there is no significant association between WW data and TPR (Fig. 1 and 2). We illustrate the estimated thresholds for WW corresponding to low, moderate, substantial, and high transmission thresholds proposed by the CDC (Fig. 3). High variability in the threshold estimation coincides with the emergence of new variants. This variability is reduced in the period where the variant is dominant. WW thresholds were determined by calculating the mean of the estimated thresholds in each period; see Table 1. Thresholds are reported as N/PMMoV (N gene copies per gram dry weight solids normalized by mild pepper mottle virus gene copies per gram dry weight solids).
Fig 3
Fig 3 Wastewater thresholds over time for (A) the City of Davis and (B) UC Davis. Blue, yellow, and red horizontal lines correspond to low, moderate, and high transmission thresholds, respectively. Raw (black triangles) and smoothed N/PMMoV WW data (gray dots).
TABLE 1
TABLE 1 WW thresholds were determined by calculating the mean of the estimated thresholds in each period
 Wastewater thresholds
DeltaOmicronBA.2–5
Davis   
Low< 5.72 × 10−4< 2.61 × 10−4< 4.82 × 10−4
Moderate(5.72 × 10−4, 7.76 × 10−4)(2.61 × 10−4, 3.65 × 10−4)(4.82 × 10−4, 6.58 × 10−4)
Substantial(7.76 × 10−4, 8.63 × 10−4)(3.65 × 10−4, 4.10 × 10−4)(6.58 × 10−4, 7.44 × 10−4)
High> 8.63 × 10−4> 4.10 × 10−4> 7.44 × 10−4
UC Davis   
Low< 5.16 × 10−4< 3.93 × 10−4< 3.87 × 10−4
Moderate(5.16 × 10−4, 6.53 × 10−4)(3.93 × 10−4, 5.28 × 10−4)(3.87 × 10−4, 5.36 × 10−4)
Substantial(6.53 × 10−4, 7.17 × 10−4)(5.28 × 10−4, 5.89 × 10−4)(5.36 × 10−4, 6.03 × 10−4)
High> 7.17 × 10−4> 5.89 × 10−4> 6.03 × 10−4
Fig. 4 illustrates the effective reproductive number ( Re ), which was computed assuming the median of the predicted TPR multiplied by the average number of tests, N , performed per day for Davis ( N=1,198 ) and UC Davis ( N=2,381 ), in the study period. We also compare Re computed with observed cases. Re trends determined from WW were similar in magnitude and depicted similar trends for Re calculated using observed cases during the periods analyzed.
Fig 4
Fig 4 Effective Re of (A) the City of Davis and (B) UC Davis computed with observed cases (black line and gray shadow) and with the median of the predicted TPR multiplied by the average number of tests performed in the study period (blue lines and blue shadow). Solid lines and shaded regions illustrate the median and 95% prediction intervals, respectively.

DISCUSSION

Substantial changes in test availability and test-seeking behavior may confound COVID-19 case count estimates in a community (22, 32). The TPR may provide a more accurate reflection of the state of the epidemic (15). TPR is an important metric because it indicates how widespread an outbreak is within a particular area where testing is being conducted and whether current levels of testing are sufficient to accurately capture levels of disease transmission (16, 18). Increments in TPR can indicate that it may be a good time to incorporate restrictions to slow the spread of disease (18).
This study proposes a sequential Bayesian framework to model the COVID-19 TPR via WW data for near real-time monitoring of the COVID-19 pandemic. The TPR is modeled as a reparametrized beta regression, and the parameters are estimated using a Bayesian approach. The changing dynamics of the virus and data availability impose challenges in the development of actionable mathematical models for the surveillance and monitoring of COVID-19 (3, 33). Here, we propose an adaptive modeling framework as an alternative to overcome some of the limitations of traditional models. The adaptive capacity of the model is well suited to capture the variability in virus trends over time by leveraging knowledge gained.
We then use the model developed to offer a retrospective estimate of WW thresholds (for the settled solids analytical method performed) that corresponded to TPR thresholds recommended by the CDC, for community transmission levels of SARS-CoV-2. The WW thresholds determined for UC Davis appeared more stable through time and through waves of different SARS-CoV-2 variants than WW thresholds estimated for the City of Davis. This can be explained by the fact that the UC Davis TPR was nearly always less than 5% while the City of Davis TPR exceeded 5% periodically over the study period. Confidence in disease dynamics breaks down as TPR rises above 5% (21). The WW thresholds estimated for the City of Davis were similar to those for UC Davis when TPR remained low but increased dramatically at the end of the study period when clinical testing rates declined in Davis, usage of at-home test kits increased, and TPR surpassed 5%. Mandatory asymptomatic testing continued for UC Davis through the end of the study period. In the absence of strong clinical testing programs, the relatively more stable WW thresholds. determined over this study period may serve as a future reference to assess relative SARS-CoV-2 infection dynamics for these sewersheds. We caution against the direct translation of the estimated WW thresholds to other sewersheds and analytical methods for WW. Further research is needed to investigate the application of this framework to other sewersheds, for alternative WW analytical methods, and other respiratory and enteric pathogens present in WW.
There are some limitations that are worth considering in our modeling framework. One of the limitations of this framework is that it requires access to both WW and test data for continued adaptation, which implies continuous community monitoring through testing. The capacity of current testing programs has decreased significantly with recent transitions to a new normal and the implementation of prevention and surveillance mechanisms such as vaccines, at-home tests, and WW surveillance. New limited testing may lead to passive case-finding (i.e., only those most likely to be infected are tested). The TPR may thus overestimate the current burden of the disease under these conditions. It is important to highlight that reduction in information poses new challenges and limitations in COVID-19 monitoring. With reductions in testing, public health authorities must decide between an indicator that overestimates the burden of the disease (TPR) and a projection that underestimates the burden of the disease (case counts). We propose to use the TPR in combination with WW measurements to reduce bias in the estimation of the burden of disease. To bypass the inherent limitations of TPR in estimating disease spread, one could consider hospitalizations. However, publicly available hospitalization is only available at the county level, and only one or maybe a few cities may have monitoring for WW, which might not represent the whole county.
In our study, the relationship between WW concentrations and TPR changed when TPR increased. This was likely due to changes in test-seeking behavior and test availability. Viral shedding may also change through time due to changes in vaccination status, acquired immunity, and changes in transmission patterns for different viral variants—although there is limited evidence available from fecal shedding studies. WW thresholds were estimated herein based on TPR for a community where testing rates were extraordinarily high, given the population size. While it is not feasible at this time to establish a priori public health thresholds based on WW concentrations alone to estimate the burden of the disease through time, a record of historical values and thresholds provides meaningful context to guide public health authorities as new waves of infection arise.

MATERIALS AND METHODS

The City of Davis and UC Davis Campus WW collection areas (commonly referred to as sewersheds) are geographically adjacent. The analysis includes laboratory-confirmed incident COVID-19 cases and WW data from 1 July 2021, to 1 July 2022, for Davis and UC Davis. This period of study captured three waves of the pandemic, namely the Delta variant (dominant from 1 July 2021 to 14 December 2022), the Omicron variant BA.1 (dominant from 15 December 2021, to 15 March 2022), and Omicron variants BA.2, BA.3, BA.4, and BA.5 (dominant from 16 March 2022, to 1 July 2022); hereafter Omicron variants will be denoted as one BA.2–5 wave. These periods were defined according to the dominance of a variant as reported by the California Department of Public Health (34).
Daily COVID-19 cases and tests for Davis were provided by Healthy Davis Together (HDT) (35). HDT is the community pandemic response program launched in Davis from September 2020 to 30 June 2022, to mitigate the spread of COVID-19. HDT involved a broad set of interventions, including free saliva-based asymptomatic testing with high throughput methods to process large volumes of tests. Testing and cases for UC Davis come from the campus community COVID-19 screening program, which includes mandatory completion of biweekly asymptomatic tests to access campus facilities. The study site is thus unique compared with most WW surveillance regions in that there was an extraordinarily high number of clinical tests performed in both sewersheds during the study period (Fig. 5) for relatively small population size (425,314 tests were performed over the study period by Davis and 835,785 for UC Davis for approximately 66,799 residents in the combined surveillance regions). The high number of tests resulted in TPR ≤0.05 for the majority of the study period in both locations (Fig. 6). These conditions provide a useful context for estimating WW thresholds corresponding to the CDC-defined TPR thresholds for community transmission levels of SARS-CoV-2 (31). The WHO contends that disease dynamics based on case data can be confidently tracked when TPR ≤0.05 (21). Otherwise, when TPR ≥0.05, WW measurements offer a more robust measure of true disease dynamics (19).
Fig 5
Fig 5 Normalized wastewater data (N/PMMoV), no. of COVID-19 tests conducted (Tests), and positive cases (Cases) for (A) Davis and (B) UC Davis.
Fig 6
Fig 6 Seven days trimmed mean of wastewater data (Smoothed N/PMMoV) and daily TPR for (A) Davis and (B) UC Davis (1 July 2021–1 July 2022).

Wastewater settled solids methods

WW settled solids for the Davis wastewater treatment plant (WWTP) were collected daily from the primary clarifier, transported on the same day of collection to the analytical laboratory, and processed within 24 h as previously described (36). WW settled solids were obtained from daily composite influent samples from UC Davis WWTP. Composite influent samples were collected using a refrigerated autosampler (Hach Sigma 900 MAX) located at the WWTP headworks and programmed to collect flow-weighted influent sample volumes every 20 min for a total volume of 19 L in 24 h. Composite influent samples were then transferred to one or two 4 L low-density polyethylene containers (LDPE Cubitainers, Thermo Scientific I-Chem) and stored at 4°C prior to settling (up to 6 d of storage). Each 4 L sample was pasteurized in a 60°C water bath for 45 min immediately prior to settling. Pasteurized influent samples were inverted to mix, poured into a 3-gallon high-density polyethylene conical vessel equipped with a sampling port (FF3G, FastFerment), and left to settle for 2 h. Settled solids were obtained from either 4 L or 8 L of influent from a single day (two 4 L samples were combined into one settling vessel when 8 L was used). Settled solids were collected by dispensing from the bottom of the settling vessel into one or two 50 mL polypropylene centrifuge tubes (for 4 L or 8 L initial volume, respectively). If two tubes of settled solids were obtained, the supernatants were carefully decanted from each, and the remaining settled solids were combined. Between sampling episodes, the settling tanks were emptied, and the tank and sampling port valves were bleached (10% commercial bleach for 1 h), rinsed with deionized water, and left to air dry. Samples of settled solids were stored at 4°C and subsequently transported on ice in a cooler by a courier to the laboratory. Sample processing was completed within 8 d of initial sample collection. Sample RNA extraction, purification, and droplet digital reverse transcriptase PCR (ddRT-PCR) followed the same protocol as for the Davis samples. These protocols are described in detail elsewhere (37, 38).
We normalize the SARS-CoV-2 RNA concentration determined (N gene copies per gram dry weight solids) by the concentration of mild pepper mottle virus (PMMoV gene copies per gram dry weight solids) to yield the dimensionless metric, N/PMMoV. The N gene is present in all variants of the virus. PMMoV is a highly abundant RNA virus detected broadly in WW (39, 40). PMMoV serves as a process control such that normalization of N by PMMoV for each sample helps correct SARS-CoV-2 concentrations for virus extraction efficiency. PMMoV is also often used to account for variations in population size, rainfall, and water usage between different WW collection areas (41). We expect these latter factors to have less of an effect on N gene concentrations determined herein because water is removed from the WW settled solids samples prior to sample analysis and concentrations are reported in terms of the dry weight of the dewatered solids. Fig. 5 shows the normalized N gene concentration.
Given that WW signals are often noise corrupted, we applied a 7-d trimmed mean for daily WW data (smoothed N/PMMoV) to reduce uncertainty and minimize daily fluctuations. The smoothed data are later correlated to raw TPR (Fig. 6).
Table 2 illustrates Pearson’s correlation coefficient between the WW data (smoothed N gene and smoothed N/PMMoV) and clinical data (cases and TPR) in different waves. An improvement in correlation is observed when using TPR, compared with other cases. In general, an increase in correlation is also observed when using the normalized signal (N/PMMoV). Additionally, Table 2 highlights a notable change (from Omicron to BA.2–5 wave) in the correlation between WW and case data, while the correlation between WW data and TPR showed a relatively small change. These observations suggest that one of the primary factors responsible for the shift in the WW-to-cases relationship could be the reduction in the number of tests conducted during the second wave (see Fig. 5).
TABLE 2
TABLE 2 Correlations between WW data (N gene or N/PMMoV) and daily TPR or COVID-19 positive cases (Cases) for Davis and UC Davis by wave
 Test positivity rateCases
AllDeltaOmicronBA.2–5AllDeltaOmicronBA.2–5
Davis        
N gene0.680.320.700.650.360.420.670.55
N/PMMoV0.710.350.760.700.390.440.730.55
UC Davis        
N gene0.760.220.740.710.640.010.650.51
N/PMMoV0.770.350.790.710.620.030.700.47

Statistical model

We model TPR as a beta distribution using WW viral loads as a covariate, assuming a Bayesian approach. We assume Yi as the TPR at day i , defined as the ratio of the number of new positive cases among the number of tests performed at day i . Beta regression is a good choice of model for continuous data with response variables expressed as proportions.
The beta distribution is reparametrized as Yi(μi,ϕ) , with mean μi and variance σi2=μi(1-μi)/(1+ϕ) , ϕ is known as the precision parameter since, for fixed μi , the larger the ϕ value, the smaller the variance of Yi;ϕ1 as a dispersion parameter (42). The mean μi can be expressed as a function of the linear predictor ηi=βTxi , where β=(β0,β1,,βp)T is a (p+1) -dimensional vector of unknown regression coefficients (including the intercept), and xi=(1,x1,,xp)T is the vector of covariates plus a one for the intercept. In this study, only the 7 d trimmed mean of the WW data (smoothed N/PMMoV), denoted as Ci , is included in the linear predictor ( p=1 ). Thus, the linear predictor is given by ηi=β0+β1Ci , and the logit link, the inverse of the logistic function, is used in the beta regression (i.e.,  logit(μi)=log(μi1μi)=ηi ).
The aim of this inference problem is to estimate θ=(β0,β1) from measurements of WW data, C=(C1,C2,,Cn) , and COVID-19 TPR, Y=(y1,y2,,yn) . Thus, the likelihood function for the previous model is given by:
L(θ|C,Y)=i=1n1B(ai,bi)yiai1(1yi)bi1,
where ai=μiϕ , bi=ϕ-ai , μi=1/(1+exp(-ηi) is the mean (logistic function), ηi=β0+β1Ci is the linear predictor, and ϕ is the dispersion parameter.

Bayesian statistical approach

We adopt a Bayesian statistical approach, which is well suited to model multiple sources of uncertainty and allows for incorporating background knowledge of the model’s parameters. In this framework, a prior distribution, πΘ(θ) , is required to account for the unknown parameter θ in order to obtain the posterior distribution. Having specified the likelihood and the prior, we use Bayes’ rule to calculate the posterior distribution,
πΘ|C,Y(θ|C,Y)=πΘ(θ)L(θ|C,Y)Z(Y),
where Z(Y)=πΘ(θ)L(θ|C,Y)dθ is the normalization constant. The posterior distribution is simulated using an existing Markov chain Monte Carlo (MCMC) method, the t-walk algorithm (43).

Bayesian sequential method

We adapt the sequential approach proposed in reference 3 to our model to update forecasts over time. The aim is to train the model using only a subset of the most recent data. The forecast is then updated sequentially in a sliding window of data.
We let L be the length in days of the period used to train the model. The data window is then moved forward every n day as new data become available. We set t 0 as the first initial time to start the analysis and the subsequent initial times as tk+1=tk+n . The training period is taken as [tk,tk+L] and the forecasting period as [tk+L,tk+L+F] , see Fig. 7.
Fig 7
Fig 7 Schematic illustrating sequential modeling approach. The model is fitted with WW and TPR data from the training period (dark gray, denoted by L) starting at time t k . Then, the estimated parameters are used to predict the forecasting period (green, denoted by F). The training window is then moved n days forward. When new data become available, we update all forecasts; the latest posterior becomes the newest prior to the next training period.
We denote θk=(β0(k),β1(k)) the model parameters to be inferred and the vectors of data as Ck,n=(Ck,,Ck+n),Yk,n=(yk,,yk+n) at period k . Note that, from the beginning, θk is assumed to change in time within each forecast window. If k=0 , we postulate a prior distribution πΘk(θk) and a likelihood L(θk|xk,n,Yk,n) previously described. The probabilistic prediction of y t , in the forecasting period t[tk+L,tk+L+F] , is obtained by using the estimated parameters through the MCMC method and the WW data concentration ( Ck+L,F ).
Afterward, the forecasting window is updated by setting tk+1=tk+n , with n as the number of days until the next forecast. In the new training window [tk+1,tk+1+L] , we propose a new prior distribution πΘk+1(θk+1) for the model parameters θk+1 using samples from the posterior distribution obtained in the previous forecast. Finally, we set k=k+1 and repeat the process described above to create a new forecast, see reference (3) for implementation details.
Regarding the elicitation of the parameters’ prior distribution for the first forecast, at k=0 , we assume a normal distribution for the parameters β0(k) , β1(k) , with mean and standard deviation (0,0) and (1,000, 1,000), respectively (i.e., noninformative priors). We set L to twice the length from symptoms onset to mild disease clinical outcome, namely 30 d, and F is chosen to be 10 d. The forecasting was updated every n=10 d.

Action thresholds for WW concentration

We let Y be the TPR, and C corresponds to the SARS-CoV-2 RNA concentrations measured in WW. Then, the cumulative distribution function of Y given C is defined as FY|C(y|c):=P(Yy|C=c), which represents the probability that TPR is less than or equal to y , given that the SARS-CoV-2 RNA concentration in WW is c . Henceforth for simplicity, we will use F(y|c) instead of FY|C(y|c) without loss of generality. The quantile function F-1 of Y given C=c is defined by
Fc1(p):=inf{yR:F(y|c)p},p(0,1).
The p -quantile of a data set is defined as the value where a p fraction of the data is below that value and (1-p) fraction of the data is above that value (e.g., the 0.5 quantile is the median).
Using the CDC thresholds for TPR values corresponding to low ( Y0.05 ), moderate ( Y(0.05,0.08) ), substantial ( Y(0.08,0.1) ), and high ( Y0.1 ) transmission levels of SARS-CoV-2 and the parameter estimates from the assumed beta regression model, we propose a methodology for estimating WW concentrations associated with TPR thresholds at a given point in time. We find the value of WW concentrations, c , such that with probability 1-α , the TPR is less than the CDC threshold y{0.5,0.8,0.1} (i.e., to find c such that Fc-1(α)=y ). Note that α is the precision we set to estimate the threshold. With α=0.05 , we are being conservative and choose lower bounds of WW concentration values associated with TPR thresholds.
The proposed method for finding these thresholds, assuming that we have simulations of the posterior distribution for the parameter θ=(β0,β1) , is given in Algorithm 1. First, we suggest a search grid for the WW viral load concentration. Then, for each concentration, we simulate the predicted posterior distribution of the TPR. Lastly, we calculate the α -quantile for each concentration level and find the concentration level that yields the quantile closest to the value of the desired TPR threshold.
Input: A sample, θ1,θ2,,θN , from the posterior distribution of Θ , where       θi=(β0,i,β1,i), i=1,,N . The precision parameter ϕ , the threshold for the       TPR, Tpr , and the probability α ;
Output: Threshold for WW concentration Tww ;
Step 1. Generate a grid of WW concentration, c1,c2,,cL ;
for l  in 1 to L do
  for i  in 1 to N do
   Step 2. Simulate Yl,iBeta(μl,i,ϕ) , where μl,i=11+exp(-ηl,i) and      ηl,i=β0,i+β1,icl ;
  Step 3. Compute the α -quantile of Yl=(Yl,1,,Yl,N) , namely Ql ;
Step 4. Find l* such that l*=argminx|Ql-Tpr| ;
Step 5. Set Tww=cl*

Effective reproductive number

The number of people in a population who are susceptible to infection by an infected individual at any particular time is denoted by Re , the effective reproductive number. This dimensionless quantity is sensitive to time-dependent variation due to reductions in susceptible individuals, changes in population immunity, and other factors. Re can be estimated by the ratio of the number of new infections ( It ) generated at time t , to the total infectious individuals at time t , given by s=1tIt-sws , the sum of new infections up to time step t-1 , weighted by the infectivity function w s . We implement the method proposed in reference 44 to calculate the Re from the TPR estimated with the WW data.
Note that the TPR is an estimation of the proportion of infected persons. Therefore, if we multiply the TPR by a T value, representing the total number of tests carried out in the study period, we will have an estimate of the incidence, which we can use to compute the Re . Since Re is a scale-free metric, we should get similar results for different T values. We use T as the average of the tests carried out in Davis or UC Davis.

ACKNOWLEDGMENTS

This research was supported by the National Center for Advancing Translational Sciences, National Institutes of Health, through grant number UL1 TR001860. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH. This research was also supported by Healthy Davis Together (HDT) programs at the University of California, Davis. The Centers for Disease Control and Prevention Foundation provided funding to Alexandria B. Boehm for wastewater analysis conducted by the Sewer Coronavirus Alert Network (SCAN).
We thank Colleen C. Naughton, Alexandria B. Boehm, and Marlene K. Wolfe for their valuable comments.
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Conceptualization, Methodology, Software, Formal analysis, Writing - Original draft preparation, Writing - Reviewing & Editing, J.C.M.-L. and M.L.D.-T.; Literature review, Writing - Reviewing & Editing, Y.E.G. and C.H.; Laboratory sample processing, Writing - methods, C.W.B.; Project conception, funding, research oversight, collaborator coordination, Writing - methods, Writing - Reviewing & Editing, H.N.B. Methodology, research oversight/supervision, Writing - Reviewing & Editing, M.N. All authors contributed to the article and approved the submitted version.

REFERENCES

1.
Capistran MA, Capella A, Christen JA. 2021. Forecasting hospital demand in metropolitan areas during the current COVID-19 pandemic and estimates of lockdown-induced 2nd waves. PLoS One 16:1–16.
2.
Giordano G, Blanchini F, Bruno R, Colaneri P, Di Filippo A, Di Matteo A, Colaneri M. 2020. Modelling the COVID-19 epidemic and implementation of population-wide interventions in Italy. Nat Med 26:855–860.
3.
Daza-Torres ML, Capistrán MA, Capella A, Christen JA. 2022. Bayesian sequential data assimilation for COVID-19 forecasting. Epidemics 39:100564.
4.
Engbert R, Rabe MM, Kliegl R, Reich S. 2020. Sequential data assimilation of the stochastic SEIR epidemic model for regional COVID-19 dynamics. Bull Math Biol 83:1–16.
5.
Jewell NP, Lewnard JA, Jewell BL. 2020. Predictive mathematical models of the COVID-19 pandemic: underlying principles and value of projections. JAMA 323:1893–1894.
6.
Shinde GR, Kalamkar AB, Mahalle PN, Dey N, Chaki J, Hassanien AE. 2020. Forecasting models for coronavirus disease (COVID-19): a survey of the state-of-the-art. SN Comput Sci 1:197.
7.
Ahmed W, Angel N, Edson J, Bibby K, Bivins A, O’Brien JW, Choi PM, Kitajima M, Simpson SL, Li J, Tscharke B, Verhagen R, Smith WJM, Zaugg J, Dierens L, Hugenholtz P, Thomas KV, Mueller JF. 2020. First confirmed detection of SARS-CoV-2 in untreated wastewater in Australia: a proof of concept for the wastewater surveillance of COVID-19 in the community. Sci Total Environ 728:138764.
8.
Huisman JS, Scire J, Caduff L, Fernandez-Cassi X, Ganesanandamoorthy P, Kull A, Scheidegger A, Stachler E, Boehm AB, Hughes B, Knudson A, Topol A, Wigginton KR, Wolfe MK, Kohn T, Ort C, Stadler T, Julian TR. 2022. Wastewater-based estimation of the effective reproductive number of SARS-CoV-2. Environ Health Perspect 130:57011.
9.
Rader B, Gertz A, Iuliano AD, Gilmer M, Wronski L, Astley CM, Sewalk K, Varrelman TJ, Cohen J, Parikh R, Reese HE, Reed C, Brownstein JS. 2022. Use of at-home COVID-19 tests-United States. Morb Mortal Wkly Rep 71:489–494.
10.
Yang J, Chen X, Deng X, Chen Z, Gong H, Yan H, Wu Q, Shi H, Lai S, Ajelli M, Viboud C, Yu PH. 2020. Disease burden and clinical severity of the first pandemic wave of COVID-19 in Wuhan, China. Nat Commun 11:5411.
11.
Bassi F, Arbia G, Falorsi PD. 2021. Observed and estimated prevalence of COVID-19 in Italy: how to estimate the total cases from medical Swabs data. Sci Total Environ 764:142799.
12.
Fenton NE, Neil M, Osman M, McLachlan S. 2020. COVID-19 infection and death rates: the need to incorporate causal explanations for the data and avoid bias in testing. J Risk Res 23:862–865.
13.
Böttcher L, D’Orsogna MR, Chou T. 2022. A statistical model of COVID-19 testing in populations: effects of sampling bias and testing errors. Philos Trans A Math Phys Eng Sci 380:20210121.
14.
Montesinos-López JC, Daza-Torres ML, García YE, Barboza LA, Sanchez F, Schmidt AJ, Pollock BH, Nuño M. 2021. The role of SARS-Cov-2 testing on hospitalizations in California. Life (Basel) 11:1336.
15.
Al Dallal A, AlDallal U, Al Dallal J. 2021. Positivity rate: an indicator for the spread of COVID-19. Curr Med Res Opin 37:2067–2076.
16.
Furuse Y, Ko YK, Ninomiya K, Suzuki M, Oshitani H. 2021. Relationship of test positivity rates with COVID-19 epidemic dynamics. Int J Environ Res Public Health 18:4655.
17.
Fenga L, Gaspari M. 2021. Predictive capacity of COVID-19 test positivity rate. Sensors (Basel) 21:2435.
18.
Dowdy D, D’Souza G. 2020. COVID-19 testing: understanding the "percent positive" Johns Hopkins Bloomberg School of Public Health. Available from: https://publichealth.jhu.edu/2020/covid-19-testing-understanding-the-percent-positive
19.
Fernandez-Cassi X, Scheidegger A, Bänziger C, Cariti F, Tuñas Corzon A, Ganesanandamoorthy P, Lemaitre JC, Ort C, Julian TR, Kohn T. 2021. Wastewater monitoring outperforms case numbers as a tool to track COVID-19 incidence dynamics when test positivity rates are high. Water Res 200:117252.
20.
Mercer TR, Salit M. 2021. Testing at scale during the COVID-19 pandemic. Nat Rev Genet 22:415–426.
21.
WHO. 2020. Public health criteria to adjust public health and social measures in the context of COVID-19: Annex to considerations in adjusting public health and social measures in the context of COVID-19, 12 May 2020. World Health Organization. https://apps.who.int/iris/handle/10665/332073.
22.
Daza-Torres ML, Montesinos-López JC, Kim M, Olson R, Bess CW, Rueda L, Susa M, Tucker L, García YE, Schmidt AJ, Naughton CC, Pollock BH, Shapiro K, Nuño M, Bischel HN. 2023. Model training periods impact estimation of COVID-19 incidence from wastewater viral loads. Sci Total Environ 858:159680.
23.
Zuccato E, Chiabrando C, Castiglioni S, Calamari D, Bagnati R, Schiarea S, Fanelli R. 2005. Cocaine in surface waters: a new evidence-based tool to monitor community drug abuse. Environ Health 4:1–7.
24.
Choi PM, Tscharke B, Samanipour S, Hall WD, Gartner CE, Mueller JF, Thomas KV, O’Brien JW. 2019. Social, demographic, and economic correlates of food and chemical consumption measured by wastewater-based epidemiology. Proc Natl Acad Sci U S A 116:21864–21873.
25.
Asghar H, Diop OM, Weldegebriel G, Malik F, Shetty S, El Bassioni L, Akande AO, Al Maamoun E, Zaidi S, Adeniji AJ, Burns CC, Deshpande J, Oberste MS, Lowther SA. 2014. Environmental surveillance for polioviruses in the global polio eradication initiative. J Infect Dis 210:S294–S303.
26.
Ai Y, Davis A, Jones D, Lemeshow S, Tu H, He F, Ru P, Pan X, Bohrerova Z, Lee J. 2021. Wastewater SARS-CoV-2 monitoring as a community-level COVID-19 trend tracker and variants in Ohio, United States. Sci Total Environ 801:149757.
27.
Xiao A, Wu F, Bushman M, Zhang J, Imakaev M, Chai PR, Duvallet C, Endo N, Erickson TB, Armas F, Arnold B, Chen H, Chandra F, Ghaeli N, Gu X, Hanage WP, Lee WL, Matus M, McElroy KA, Moniz K, Rhode SF, Thompson J, Alm EJ. 2022. Metrics to relate COVID-19 wastewater data to clinical testing dynamics. Water Res 212:118070.
28.
D’Aoust PM, Tian X, Towhid ST, Xiao A, Mercier E, Hegazy N, Jia J-J, Wan S, Kabir MP, Fang W, Fuzzen M, Hasing M, Yang MI, Sun J, Plaza-Diaz J, Zhang Z, Cowan A, Eid W, Stephenson S, Servos MR, Wade MJ, MacKenzie AE, Peng H, Edwards EA, Pang X-L, Alm EJ, Graber TE, Delatolla R. 2022. Wastewater to clinical case (WC) ratio of COVID-19 identifies insufficient clinical testing, onset of new variants of concern and population immunity in urban communities. Sci Total Environ 853:158547.
29.
Frampton D, Rampling T, Cross A, Bailey H, Heaney J, Byott M, Scott R, Sconza R, Price J, Margaritis M, Bergstrom M, Spyer MJ, Miralhes PB, Grant P, Kirk S, Valerio C, Mangera Z, Prabhahar T, Moreno-Cuesta J, Arulkumaran N, Singer M, Shin GY, Sanchez E, Paraskevopoulou SM, Pillay D, McKendry RA, Mirfenderesky M, Houlihan CF, Nastouli E. 2021. Genomic characteristics and clinical effect of the emergent SARS-CoV-2 B.1.1.7 lineage in London, UK: a whole-genome sequencing and hospital-based cohort study. Lancet Infect Dis 21:1246–1256.
30.
Graham MS, Sudre CH, May A, Antonelli M, Murray B, Varsavsky T, Kläser K, Canas LS, Molteni E, Modat M, Drew DA, Nguyen LH, Polidori L, Selvachandran S, Hu C, Capdevila J, COVID-19 Genomics UK (COG-UK) Consortium, Hammers A, Chan AT, Wolf J, Spector TD, Steves CJ, Ourselin S. 2021. Changes in symptomatology, reinfection, and transmissibility associated with the SARS-CoV-2 variant B.1.1.7: an ecological study. Lancet Public Health 6:e335–e345.
31.
Christie A, Brooks JT, Hicks LA, Sauber-Schatz EK, Yoder JS, Honein MA, CDC COVID-19 Response Team. 2021. Guidance for implementing COVID-19 prevention strategies in the context of varying community transmission levels and vaccination coverage. MMWR Morb Mortal Wkly Rep 70:1044–1047.
32.
Subramanian R, He Q, Pascual M. 2021. Quantifying asymptomatic infection and transmission of COVID-19 in New York City using observed cases, serology, and testing capacity. Proc Natl Acad Sci U S A 118:e2019716118.
33.
Daza-Torres ML, García YE, Schmidt AJ, Pollock BH, Sharpnack J, Nuño M. 2022. The impact of COVID-19 vaccination on California's return to normalcy. PLoS One 17:e0264195.
34.
SCAN. 2023. Sewer coronavirus alert network. Retrieved 8 June 2022.
35.
HDT. 2020. Healthy Davis Together. Available from: https://healthydavistogether.org/testing-data/. Retrieved 8 Jun 2022.
36.
Wolfe MK, Topol A, Knudson A, Simpson A, White B, Vugia DJ, Yu AT, Li L, Balliet M, Stoddard P, Han GS, Wigginton KR, Boehm AB. 2021. High-frequency, high-throughput quantification of SARS-CoV-2 RNA in wastewater settled solids at eight publicly owned treatment works in Northern California shows strong association with COVID-19 incidence. mSystems 6:e0082921.
37.
Topol A, Wolfe M, White B, Wigginton K, B B Boehm A. 2021. High throughput pre-analytical processing of wastewater settled solids for SARS-CoV-2 RNA analyses V2. ZappyLab, Inc.
38.
Topol A, Wolfe M, White B, Wigginton K, Boehm A BB. 2022. High throughput SARS-COV-2, PMMOV, and BcoV quantification in settled solids using digital RT-PCR, . In Protocols Io.
39.
Rothman JA, Loveless TB, Kapcia J III, Adams ED, Steele JA, Zimmer-Faust AG, Langlois K, Wanless D, Griffith M, Mao L, Chokry J, Griffith JF, Whiteson KL, Nojiri H. 2021. RNA viromics of southern California wastewater and detection of SARS-CoV-2 single-nucleotide variants. Appl Environ Microbiol 87:e01448–21.
40.
Symonds EM, Nguyen KH, Harwood VJ, Breitbart M. 2018. Pepper mild mottle virus: a plant pathogen with a greater purpose in (waste) water treatment development and public health management. Water Res 144:1–12.
41.
Hsu S-Y, Bayati M, Li C, Hsieh H-Y, Belenchia A, Klutts J, Zemmer SA, Reynolds M, Semkiw E, Johnson H-Y, Foley T, Wieberg CG, Wenzel J, Johnson MC, Lin C-H. 2022. Biomarkers selection for population normalization in SARS-CoV-2 wastewater-based epidemiology. Water Res 223:118985.
42.
Ferrari S, Cribari-Neto F. 2004. Beta regression for modelling rates and proportions. J Appl Stat 31:799–815.
43.
Christen JA, Fox C. 2010. A general purpose sampling algorithm for continuous distributions (the t-walk). Bayesian Anal 5:263–281.
44.
Cori A, Ferguson NM, Fraser C, Cauchemez S. 2013. A new framework and software to estimate time-varying reproduction numbers during epidemics. Am J Epidemiol 178:1505–1512.

Information & Contributors

Information

Published In

cover image mSystems
mSystems
Volume 8Number 431 August 2023
eLocator: e00018-23
Editor: Ileana M. Cristea, Princeton University, Princeton, New Jersey, USA
PubMed: 37489897

History

Received: 10 January 2023
Accepted: 1 May 2023
Published online: 25 July 2023

Keywords

  1. SARS-CoV-2
  2. COVID-19
  3. test positivity rate (TPR)
  4. wastewater-based epidemiology (WBE)
  5. effective reproductive number
  6. Bayesian sequential data assimilation

Data Availability

The codes implemented for the study are available in the GitHub repository. Analyses were carried out using Python version 3.9.

Contributors

Authors

J. Cricelio Montesinos-López https://orcid.org/0000-0001-5017-6192
Department of Public Health Sciences, University of California Davis, Davis, California, USA
Author Contributions: Conceptualization, Formal analysis, Methodology, Software, Visualization, Writing – original draft, and Writing – review and editing.
Department of Public Health Sciences, University of California Davis, Davis, California, USA
Author Contributions: Conceptualization, Formal analysis, Methodology, Software, Visualization, Writing – original draft, and Writing – review and editing.
Yury E. García
Department of Public Health Sciences, University of California Davis, Davis, California, USA
Author Contributions: Investigation and Writing – review and editing.
César Herrera
Department of Mathematics, Purdue University, West Lafayette, Indiana, USA
Author Contribution: Writing – review and editing.
C. Winston Bess
Department of Civil and Environmental Engineering, University of California Davis, Davis, California, USA
Author Contributions: Data curation, Funding acquisition, Methodology, Project administration, Writing – original draft, and Writing – review and editing.
Department of Civil and Environmental Engineering, University of California Davis, Davis, California, USA
Author Contributions: Funding acquisition, Methodology, Project administration, Resources, Supervision, Writing – original draft, and Writing – review and editing.
Miriam Nuño [email protected]
Department of Public Health Sciences, University of California Davis, Davis, California, USA
Author Contributions: Data curation, Methodology, Resources, Supervision, and Writing – review and editing.

Editor

Ileana M. Cristea
Editor
Princeton University, Princeton, New Jersey, USA

Notes

J. Cricelio Montesinos-López and Maria L. Daza-Torres contributed equally to this article. Author order was determined by alphabetical order.
The authors declare no conflict of interest.

Metrics & Citations

Metrics

Note:

  • For recently published articles, the TOTAL download count will appear as zero until a new month starts.
  • There is a 3- to 4-day delay in article usage, so article usage will not appear immediately after publication.
  • Citation counts come from the Crossref Cited by service.

Citations

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. For an editable text file, please select Medlars format which will download as a .txt file. Simply select your manager software from the list below and click Download.

View Options

Figures

Tables

Media

Share

Share

Share the article link

Share with email

Email a colleague

Share on social media

American Society for Microbiology ("ASM") is committed to maintaining your confidence and trust with respect to the information we collect from you on websites owned and operated by ASM ("ASM Web Sites") and other sources. This Privacy Policy sets forth the information we collect about you, how we use this information and the choices you have about how we use such information.
FIND OUT MORE about the privacy policy