A predictive model for Covid-19 spread – with application to eight US states and how to end the pandemic

Z. S. Khan; F. Van Bussel; F. Hussain

doi:10.1017/S0950268820002423

A predictive model for Covid-19 spread – with application to eight US states and how to end the pandemic

Published online by Cambridge University Press: 08 October 2020

Z. S. Khan ,

F. Van Bussel and

F. Hussain

Show author details

Z. S. Khan: Affiliation:
Department of Mechanical Engineering, Texas Tech University, 2703 7th Street, Box: 41021, Lubbock, TX79409, USA
F. Van Bussel: Affiliation:
Department of Mechanical Engineering, Texas Tech University, 2703 7th Street, Box: 41021, Lubbock, TX79409, USA
F. Hussain*: Affiliation:
Department of Mechanical Engineering, Texas Tech University, 2703 7th Street, Box: 41021, Lubbock, TX79409, USA
*: Author for correspondence: F. Hussain, E-mail: [email protected]

Article contents

Abstract
Introduction
A new model
Assumptions
Results
Discussion
Appendices
Data availability statement
References

Rights & Permissions

Abstract

A compartmental model is proposed to predict the coronavirus 2019 (Covid-19) spread. It considers: detected and undetected infected populations, social sequestration, release from sequestration, plus reinfection. This model, consisting of seven coupled equations, has eight coefficients which are evaluated by fitting data for eight US states that make up 43% of the US population. The evolution of Covid-19 is fairly similar among the states: variations in contact and undetected recovery rates remain below 5%; however, variations are larger in recovery rate, death rate, reinfection rate, sequestration adherence and release rate from sequestration. Projections based on the current situation indicate that Covid-19 will become endemic. If lockdowns had been kept in place, the number of deaths would most likely have been significantly lower in states that opened up. Additionally, we predict that decreasing contact rate by 10%, or increasing testing by approximately 15%, or doubling lockdown compliance (from the current ~15% to ~30%) will eradicate infections in Texas within a year. Extending our fits for all of the US states, we predict about 11 million total infections (including undetected), and 8 million cumulative confirmed cases by 1 November 2020.

Keywords

COVID-19

Type: Original Paper
Information: Epidemiology & Infection , Volume 148 , 2020 , e249

DOI: https://doi.org/10.1017/S0950268820002423 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (http://creativecommons.org/licenses/by-nc-sa/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is included and the original work is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use.
Copyright: Copyright © The Author(s), 2020. Published by Cambridge University Press

Introduction

The first cases of community coronavirus 2019 (Covid-19) transmission in the United States were reported in California, Oregon, Washington state and New York state in late February 2020 [Reference Schumaker1]. A Santa Clara, California death on 6 February was deemed the country's first Covid-19 fatality after an autopsy was conducted in April [Reference Schumaker1]. A national emergency was declared by US President Donald Trump on 13 March 2020, and testing several days later revealed that Covid-19 had spread to all 50 states [Reference Schumaker1]. On 20 March, New York City was declared the US outbreak epicentre [Reference Schumaker1]. A study, released in April 2020 as a preprint, found via genetic analysis of Covid-19 cases in New York City that the majority of the viruses originated in Europe – revealing that transmissions had begun as early as January from countries with no travel monitoring [Reference Gonzalez-Reiche2]. As of 29 June 2020, the US had 2 496 628 confirmed Covid-19 cases, and 125 318 Covid-19 deaths [3].

Covid-19 challenges faced by the US include fair allocation of adequate medical resources [Reference Emanuel4], minimising mortality, avoiding overwhelming the health-care system and keeping the effects of lockdown policies on the economy within manageable levels [Reference Anderson5]. Epidemiological analysis of the virus proliferation is needed to assess the impacts of mitigation strategies including social distancing, sheltering-in-place (voluntary) and quarantines (enforced by authorities) [Reference Anderson5], as well as personal habits including frequent hand washing and wearing masks. We have developed a new compartmental model, extending the long-standing SIR (Susceptible, Infected, Removed) model [Reference Brauer, Castillo-Chavez and Feng6–Reference Chang8], to evaluate and compare several states' responses to Covid-19; with this model we can make estimates, using curve fitting of reported data, of the impact of contact suppression measures and the lifting of such measures. We find that, for the current situation where several states have relaxed stay-at-home measures, Covid-19 will become an endemic virus for at least two years; so it is not surprising that some states are reinstating the stay-at-home measures [Reference Lee9]. More in-depth projections (made by varying contact rates, detection rates and sequestration adherence), using Texas as a test case, suggest that a modest increase in the testing rate, a modest decrease in the contact rate or a significant increase in lockdown compliance could eradicate the virus within a year.

A new model

Our SQUIDER model incorporates additional processes into the classic SIR model: (i) making a distinction between known cases (which are publicly reported) and asymptomatic or mild cases which are not monitored or detected; (ii) including the effects of responses, with varying adherence by region, to the pandemic, whether direct, through quarantine or medical isolation of diagnosed cases, or less direct such as social distancing efforts and (iii) possible loss of immunity of recovered individuals, allowing some of them to be reintroduced into the Susceptible population. The model thus requires several new compartments, which we will denote as U (Undetected infected), E (undetected recovered) and Q (pseudo-Quarantine, a bin to hold a segment of the susceptible and undetected infected populations allowing us to model reduced human interactions due to social distancing). Furthermore, for modelling/fitting purposes we add a separate compartment D for known infecteds who die; while undetected infections, undetected deaths, undetected recoveries and quarantine adherence are not available, deaths from the virus are generally reported [10]. Figure 1 shows the connections among different compartments in our model.

Fig. 1. Schematic of the compartments, with the rates of transfer between the compartments.

The rate equations are as follows:

(1)$$\matrix{ {\displaystyle{{{\rm d}S} \over {{\rm d}t}} = {-}\beta SU^a-q\lpar t\rpar S + \rho \lpar E + R\rpar } \cr } $$

(2)$$\matrix{ {\displaystyle{{{\rm d}U} \over {{\rm d}t}} = \beta SU^a-\lpar q\lpar t\rpar + \epsilon + \delta \rpar U} \cr } $$

(3)$$\matrix{ {\displaystyle{{{\rm d}I} \over {{\rm d}t}} = \delta U-\lpar \gamma + \alpha \rpar I} \cr } $$

(4)$$\matrix{ {\displaystyle{{{\rm d}R} \over {{\rm d}t}} = \alpha I-\rho R} \cr } $$

(5)$$\matrix{ {\displaystyle{{{\rm d}D} \over {{\rm d}t}} = \gamma I} \cr } $$

(6)$$\matrix{ {\displaystyle{{{\rm d}Q} \over {{\rm d}t}} = q\lpar t\rpar \lpar U + S\rpar } \cr } $$

(7)$$\matrix{ {\displaystyle{{{\rm d}E} \over {{\rm d}t}} = \epsilon U-\rho E} \cr } $$

Each compartment is normalised by the total population N; hence

(8)$$S\, + \,Q\, + \,U\, + \,I\, + \,D\, + \,E\, + \,R\, = \,1.$$

Note that the coefficients α, β, δ, ε, γ and ρ are constants (to be evaluated from fits to data). Being normalised, our compartment variables are non-dimensional, and our rate coefficients have units of days to the power of −1, so the model could be described as a one-dimensional continuous dynamical system.

Before we go through the individual equations we should discuss some of the recurring terms and factors. First, the incidence rate βSU^a, the average normalised new infections in time, is nonlinear when a ≠ 1. Here β is the contact rate, which is the average number of contacts a person has per day, multiplied by the probability of transmitting the disease when contact between a Susceptible and an Undetected infected occurs, i.e. the level of contagiousness. Detected Infecteds (I) are not involved since we assume that, post-diagnosis, the I group are generally in medical isolation or some other form of quarantine [Reference Parmet and Sinha11]. If a = 1, this term describes homogenous mixing of the Susceptible and Undetected infected populations [Reference Liu and Stechlinski12], which may not be accurate for states with isolated populations, low population densities, or many densely populated areas; the relationship may be sublinear or superlinear depending on the population being sparse or dense. Power law incidence rates (such as βSU^a) have been shown to improve the accuracy of SIR models [Reference Novozhilov13, Reference Roy and Pascual14].

Second, the factor q(t) models the time dependence of social distancing and contact suppression, as well as the effects of subsequent lockdown release. Social distancing is modelled by transferring a proportion of the Susceptible and Undetetected infected populations at a time t ₁ into the Q compartment. This does not imply that some large number of undiagnosed people are put into any actual physical quarantine, only that the available sub-groups for infecting (U) and for becoming infected (S) are reduced; alternatively, this might possibly be modelled by altering β or the power law dependency a in a time-dependent way. Mathematically, this transfer is realised within the ODE model by having q(t) be time-dependent – that is 0 until close to the time of sequestration; it then becomes large very quickly and then subsides quickly to 0 again; in other words, q(t) is a pulse. This pulse transfers populations between compartments – for example, from S and U to Q at the start of a lockdown, and from Q to S when lockdown is released. Note that the transferred population will stay in the Q compartment until a subsequent, negatively weighted q ₂(t) pulse is generated. This pulse form was chosen, as opposed to a constant value used by some authors [Reference Xu15–Reference Maier and Brockmann17], because many states went into lockdown on a particular day with some, but not all, people self-isolating [Reference Gostin, Hodge and Wiley18, Reference Mervosh, Lu and Swales19]. In particular, the day t ₁ where such measures take effect and the maximum pulse magnitude q ₁ are of interest because they reflect the compliance (or lack thereof) of the state's population. A constant rate of sequestration, on the other hand, is not only somewhat unnatural in terms of social dynamics, but will result in the entire population eventually being sequestered unless there is some opposite movement taking people out of sequestration as well.

Equation 1 for the Susceptible population is reduced nonlinearly by new infections βSU^a due to interactions, and explicitly reduced in a time-dependent way by sequestration q(t)S (not by any significant amount until we get close to the activation time t ₁), as well as increased again by re-entrance at a rate ρ by the undetected recovered (E) and Recovered (R) groups. This term was added due to the WHO revelation that recovered Covid-19 patients may have little or waning immunity after exposure [20], later confirmed by an antibody study conducted on individuals who had recovered from Covid-19 infections [Reference Long21].

Equation 2, for Undetected infecteds, includes the increase due to contact with S members, and removal by various causes. The rate ε corresponds to the recovery rate in the basic SIR model. The detection rate δ specifies the proportion of Undetected infected individuals who are diagnosed with the virus (and are hence no longer undetected); it is added to the I compartment. Finally, this population is also effectively reduced due to social distancing q(t), e.g. residents of many states were encouraged to shelter at home and not seek testing/diagnosis unless they became symptomatic, in order to ease pressure on medical resources [22].

Equation 3 for the detected Infected population, describes increases due to testing at rate δ and decreases due to death with rate γ, and recovery with rate α. Since these individuals are isolated in designated hospital wards or under quarantine at home [Reference Parmet and Sinha11], hence unlikely to be a source of infection to the community at large, we felt there was no need for sequestration (i.e. q(t)) when social distancing went into effect. This is in agreement with the WHO guidance on quarantines segregating suspected exposed people [20].

Equation 4 describes the growth of detected Recovered, balanced by outflux of the Recovered population into the susceptible compartment at rate ρ due to little or waning immunity, expected for human coronaviruses [Reference Callow23]. Equation 5 describes the increase in Deceased detected individuals. Equation 6 describes the increase in the pseudo-Quarantine compartment due to official contact suppression measures q(t), which as stated above is only significant around the activation times t ₁ and t ₂. Equation 7 describes increases in the undetected recovered population at rate ε, and decreases in this population due to loss of immunity of E at rate ρ.

Assumptions

Several simplifying assumptions or idealisations have been made. To begin with, in our model the detected Infecteds (I) do not transmit the disease to the Susceptible (S) population. It is generally the case that in all such disease outbreaks (e.g. the 2014–2016 Ebola outbreak), even when strict quarantine measures are in place, medical service providers and other people rendering direct aid to victims are themselves vulnerable to infection; when the outbreak (in the non-healthcare worker population) is contained they may even make up the substantial proportion of cases [24]. However, in the current situation where the disease circulates through the general population and safety protocols are rigorously enforced by frontline health providers [Reference Ng25], the proportion of such cases relative to the general population is negligible.

Other simplifying assumptions: as mentioned above, we have opted to keep our contact rate β constant and instead vary the S and U compartment population levels to mimic social distancing effects (for example staying at home). In future versions of our model we may incorporate time-dependent β or a in order to disentangle population-wide transmission suppression (e.g. face masks and other protective gear for the public) from social contact suppression (cancellation of concerts and other public gatherings), but for the sake of simplicity in both coding and analysis we have opted for now to use only one time-dependent rate. In the same vein, the q(t) function removes Susceptibles and Undetected infecteds at the same rate (we have no reason at this time to differentiate the rates of change of these populations).

Similarly, people from the E and R compartments lose immunity at the same rate ρ. Indeed, it is possible that people who experienced milder forms of the disease (E) lose immunity faster than those who experienced more severe symptoms and sought out treatment (R) [Reference Long21]; however, we use a single rate ρ for simplicity. As other authors [Reference Moghadas16, Reference Kennedy26, Reference Li27], we did not consider our undetected recovery rate ε to be equivalent to α since it is possible that recovery rates are different for people with mild or no symptoms who do not seek medical care in comparison with people who do seek medical care (the I population). We additionally do not expect such mild or asymptomatic cases to be fatal [Reference Basu28]; therefore, people are not removed from the U compartment due to death. Note that, at least in the early days of the pandemic, increased overall deaths in comparison with the prior three years were not examined for signs that the virus was active among undiagnosed populations [Reference Weinberger29]. Finally, we do not consider the effects of births, vertical transmissions, immigrants, emigrants or deaths due to other diseases or trauma. The inclusion of deaths due to diagnosed virus cases makes the model not entirely static, but the disease's total deaths as a proportion of the total population is low enough that births and other such aspects can be safely omitted.

Results

Coefficient evaluation

Figure 2 shows fits of the model to data (cumulative confirmed case counts and deaths due to Covid-19) for Arizona, California, Florida, Illinois, Louisiana, New Jersey, New York State and Texas. The fits to cumulative case counts are all excellent – even for states such as Illinois, New Jersey and New York that have a distinct inflection in case counts. It is also apparent that some states, such as Arizona, California, Florida, Louisiana and Texas, have had rapidly rising case counts in June – likely due to easing of restrictions. This feature is captured by our q(t) – where application of a second, possibly negative pulse moves people from the Q to the S compartments as described above. Note that model fits to cumulative death counts deviate from the data starting in mid-May in some states (Arizona, California, Florida, Louisiana, New Jersey, New York and Texas). This may be due to improved medical treatments (such as dexamethasone [Reference Cain and Cidlowski30]), virus mutations resulting in a less deadly strain, or the postulated lack of reliability in confirmation of US Covid-19 deaths, which may be significantly undercounted [Reference Bump31]. Fit parameters are listed in Table 1. The contact and exclusion rates vary by less than 5%; however, reintroduction rate varies by 320%, detected recovery rate by 65%, re-release rate by 268%, stay-at-home effect by 103% and death rate by 59%. These variations, though large, are not surprising due to different states having been in different stages of the outbreak. For example, New York's and New Jersey's recovery rates are significantly lower than other states'. This could be due to the fact that these states had saturated hospital capacities during their peak outbreak [Reference Rothfield32, Reference Duffy33], causing slower recoveries due to decreased access to medical staff and treatments. Figure 3 additionally shows the dynamics of the U, E and R, as well as I and D (fitted compartments) over the fit period (22 January to 29 June 2020). Note that the U curves follow the I curves because individuals are transferred from U to I at a constant rate δ and out of I at a constant rate α + γ. In this figure, it appears that U and E cases in New York and New Jersey have stabilised, similar to the I + R cases in Figure 2. Presumably this is due to the outbreak occurring earlier in those states, and possibly because the official response has been more rigorous.

Fig. 2. SQUIDER model fits. Fits of our compartment model to recorded data on confirmed cumulative case counts and deaths; all fits have R ² ≥ 0.996. Data were obtained from The Johns Hopkins University [34]. The vertical dotted line indicates the last date fitting data was obtained for.

Fig. 3. Computed results for the other compartments for different states. Current unknown infected (U), unknown recovered (E), current detected Infected (I), detected Recovered (R) and detected Deaths (D).

Table 1. SQUIDER model fit parameters for selected US states

All rate parameters have units of days⁻¹, times have units of days, positive q_i values denote to proportions of the S and U compartments, negative q₂ values denote to proportions of the Q compartment and the initial condition U ₀ × N is given in units of individuals.

We compare our fit parameters with the classical SIR model which, to remind the reader, involves only Susceptible, Infected and Recovered compartments. The β and parameters correspond to the contact and recovery rates of the SIR model; the fit values imply that for unconstrained epidemic situations (with q(t) = δ = 0) the disease has a reproduction number R ₀ = β/ε of around 5. The contact and recovery rates we find are consistent with several prior investigations [Reference Maier and Brockmann17, Reference Liu35]. See Table 2; for further discussion of how the basic SIR model fares in comparison with SQUIDER, see the appendices.

Table 2. Basic R ₀ and effective R_t reproduction number values for selected US states

It is surprising that the detection rate δ is so high for all of the states (≈0.5); however, a recent nationwide coronavirus antibody study by the Spanish Health Ministry [Reference Jones36] suggests that the number of unknown infected and unknown recovered in large and heterogeneous jurisdictions, while significant, is not orders of magnitude larger than the number of confirmed cases. This goes against some prior speculation that the asymptomatic and undiagnosed cases might be as much as 10 times the official count [Reference Gaeta37], which would suggest a detection rate 5 times smaller than our δ values.

Returning back to Figure 2 and Table 1, the death rate from diagnosed cases γ falls between around 2% and 3.5%, which is well within the quite wide range of case-fatality rates reported for earlier phases of the pandemic [Reference Roser38]. The recovery rate α for diagnosed cases, seems somewhat high (in the 0.45–0.5 range for New York and New Jersey, and around 0.7–0.8 for the other states); this could reflect the fact that detection of any disease would normally occur after that disease has already partly run its course, but it should be kept in mind as well that this compartment has a minimal effect on the size of the fitted compartments I and D, so the fitting routine may not be as constrained in selecting the α value as the other fit parameters. The re-entry (due to loss of immunity) rate ρ is fairly low for most of the states, which indicates that this is not a significant factor for the initiation of the outbreak. Such low ρ values are not unreasonable since loss of immunity to corona viruses that cause common colds is typically slow, taking several months in some individuals, up to a year in others [Reference Callow23]. Indeed, one recent report found that antibodies in a high proportion of individuals who recovered from Covid-19 started to decrease by about 10% within 2–3 months after infection, suggesting a gradual loss of immunity [Reference Long21]. Surprisingly, Louisiana has a much larger ρ value than other states (≈0.17). This was selected by the fitting routine to account for the unexpected rising new cases during that states' lockdown period. As we see below, the reintroduction of people to the Susceptible compartment obviously results in an endemic infection in the predictions.

Our fitted peak initial sequestration values range over 0.1–0.2 for all states except Illinois (whose value is close to 0.04). The parameter t ₁ enables prediction of what day self-isolation policies started having significant effects on case counts – see Table 1. To compare t ₁ to states' directives, stay-at-home orders were issued in Arizona on 31 March, California on 19 March, Florida on 1 April, Illinois on 21 March, Louisiana on 23 March, New Jersey on 21 March, New York on 22 March, but was suggested in Texas on 2 April [Reference Mervosh, Lee and Popovich39]. There is a one week or so time lag between announced state action and its measured effect, which may or may not have a physiological basis [Reference Bartels and Achen40]. It is possible that Texas, Arizona and Florida residents were following local orders which prescribed sheltering in place sooner than the state orders: as examples Dallas had a state of emergency declared on 19 March, and Houston residents were urged to stay at home on 24 March [Reference Debenedetto and Watkins41].

Many states partially re-opened in May. Our fit parameter q ₂ for the various states corresponds to the percentage of people moving between the Susceptible and Quarantine compartments – where a negative number indicates the percentage decreasing, moving from the Quarantine to the Susceptible compartment. Specifically, Arizona, Florida and Texas had significant numbers (>30%) exiting from stay-at-home conditions, whereas California and Louisiana had smaller numbers (<11%), Illinois had 4.5% additional people enter the Q compartment and New Jersey had a small Q entry (<0.5%). The date at which this occurred, t ₂, can also be compared with the dates stay-at-home orders were lifted or expired – also in Table 1. Stay-at-home orders were relaxed or expired on 15 May in Arizona, low-risk businesses and some restaurants opened in California on 8 May; stay-at-home expired in Florida on 4 May, in Louisiana on 15 May, in New York on 28 May, and in Texas on 30 April. New Jersey, on the other hand, did not officially relax social distancing measures within the time range of our data. The time lag between t ₂ and official dates may correspond to the end of the school year and people's perception of safety.

The initial values of the undetected infecteds U(0) are all less than one individual (some significantly), implying that none of the states we look at had any actual cases on 22 January (the first day for which we have data). While the ODE results can be scanned to find an estimate for the arrival of the first case (or first two, or five cases) in a jurisdiction, some caution should be exercised in applying this number, since magnitudes at this point are still too small to make statistically valid comparisons. Given that the model estimates there to have been at least 10 cases in the states studied (except Arizona, Florida and Illinois) by the 3rd or 4th week of February (the 1st week of February for New York) we think it is probable that Covid-19 was spreading considerably sooner in New York, New Jersey, Texas and Louisiana than previously assumed. This implies that stronger measures – such as travel bans, cluster identification, contact tracing and quarantine measures – were needed to fully contain the outbreak [Reference Pinotti42, Reference Chinazzi43]. California had reported cases already in late January, yet both the reported data and our ODE model show the main outbreak occurred well after New York or New Jersey. This is likely if the western states were dealing successfully with cases coming directly from Asia, but lost control of the outbreak when infected individuals started arriving from the eastern US or possibly Europe; possible differences between the Asian and European Covid-19 strains are not addressed here.

Model predictions

We have generated predictions using the current fits for two years beyond the first date of recorded US cases. Figure 4a–h show that the total number of infections increases substantially in most states from July 2020 to January 2021, except for Illinois, which apparently experienced its peak case count in May. It is predicted, however, that all states will have continued Covid-19 infections for the next two years, some with small secondary peaks occurring in the spring of 2021. California, Illinois and New Jersey do not have secondary peaks – this is likely due to these states having positive q ₂ values (meaning more people enter Q than leave), small negative q ₂ values or very small ρ values. Louisiana also does not have secondary peaks – most likely due to this state's large ρ value where reintroduction of sufficient individuals into the susceptible population damps out oscillations in the infected population [Reference Hethcote, Stech and van den Driessche44]. For oscillations to be present in compartment models, cycling of populations has to occur at an intermediate rate – having a high re-entry rate leads to steady infections; and having no re-entry results in eradication of the virus. Our projected daily deaths (Fig. 4i) show that Arizona, Florida, New York and Texas have secondary peaks in deaths after the first main peak. New Jersey and Illinois avoid a secondary peak, presumably because these states haven't yet reopened. We have performed fits to all of the states in the US and predict 11 326 089 cumulative cases, and 8 346 433 cumulative confirmed cases (for all fits R ² ≥ 0.96), assuming no further interventions (such as additional lockdowns).

Fig. 4. Model predictions. Total infected (U + I), (I) and confirmed daily deaths (D) for two years beyond the first day of recorded infections generated from our model fits, and the counterfactual scenario of not having lifted stay-at-home orders. The left vertical dashed line indicates the day ‘shelter-in-place’ orders were implemented, and the right vertical dashed line marks the day where such orders were lifted. (I) Confirmed daily death counts for each state for model fits. (J) Confirmed daily death counts for the counterfactual scenario of not lifting orders.

We have also generated counterfactual estimates of case counts (Fig. 4a–h) for the hypothetical situation of sustained stay-at-home orders, i.e. hence disallowing transfer between Q and S. The daily peak total cases is ≈10 times lower for Arizona, Florida, Louisiana, New York and Texas, in comparison with trends predicted from current data. Keeping the stay-at-home orders had weaker effects in Louisiana and California than the other states – due to only releasing small numbers of their Q population back to S (≈10% for Louisiana, <1% for California). Our counterfactual daily deaths (Fig. 4j) show that maintaining staying-at-home could have significantly reduced deaths in Arizona, Florida, Louisiana, New York and Texas. California was not strongly affected because its residents did not fully re-open, and Illinois and New Jersey's counterfactual and factual projections do not differ since these states did not reopen.

Sensitivity to intervention level

Given our grim prediction (Fig. 4), it is natural to ask if there is some non-pharmaceutical intervention (i.e. something other than vaccines) that could improve the situation. We show the effects of actions such as mandating mask-wearing in public by reducing the contact rate β in Figure 5a for Texas. Decreasing β by 10% results in virtual eradication of Covid-19 in Texas within one year. Increasing the detection rate (i.e. test-and-trace) by approximately 15% will also eradicate the virus within a year, shown in Figure 5b, as will also doubling lockdown compliance q ₁. These show that β is the most sensitive parameter with respect to reduction of infection rates, though the most practical approach in terms of trade-offs and compliance is simply for governments to increase testing.

Fig. 5. Predictions for increasing the effect of non-pharmaceutical interventions for Texas. Total (U + I) case counts for: (a) Decreasing the contact rate β, (b) increasing the detection date δ and (c) increasing the sequestration compliance via quarantine rate q ₁.

Discussion

Several studies have already modelled the growth of Covid-19 infections and deaths in the US or its various states. As expected for nonlinear systems, some predictions, even if accurate for a short time, can deviate significantly with increased time. A SEIR model (Susceptible, Exposed, Infected, Removed) was implemented on a network to simulate inter-state travel [Reference Peirlinck45]. They predict that, in the absence of countermeasures, the outbreak peaks on day 54 in their simulation (10 May 2020). Other SEIR-type models with additional compartments [Reference Xu15, Reference Moghadas16] including quarantine, also predict that the US outbreak peaks near 10 May 2020 [Reference Xu15], or peaks in the general population 15 weeks into the outbreak (approximately by the last week of April) if only 5% of the population practices self-isolation within a day of symptom onset [Reference Moghadas16]. A logistic model of Covid-19 growth in the US predicts that the cumulative number of cases plateaus by 14 May 2020 [Reference Kriston46]. Alternatively, a neural network parametric model was developed, which predicted that the US would reach the peak number of cases by 8 April 2020 [Reference Uhlig47]. Additionally, a sigmoidal Hill-type model predicts that the US will have 735 920 cases within 76 days of the outbreak, with 41 285 deaths [Reference Aboelkassem48].

More recent Covid-19 models make more dire predictions for the US. A simple SIRD (where D denotes deaths) model [Reference Al-Raeei49], fit to data up to 30 May 2020 predicts that there will be 3.8 million infected and 244 420 deaths by 1 September 2020. A SEIR-type model with additional compartments including unsusceptible (to take into account social distancing), hospitalised and critical populations was proposed by Kennedy et al. [Reference Kennedy26]. They took into account social distancing by removing individuals between the susceptible and unsusceptible compartments at a time-dependent rate. They predict that, for a relaxed social distancing scenario where 40% of the US population is unsusceptible and fitting to data up to 4 May 2020, there will be 60 million infections and 750 000 deaths by December 2020. Li et al. [Reference Li27] have developed a new compartmental model (named DELPHI) based on the SEIR model that also considers additional compartments – undetected, hospitalised and quarantined. Government interventions are taken into account with a time-dependent contact rate. Fitting to data up to 19 May 2020, the model predicts approximately 213 million Covid-19 cases by 15 July 2020, with restrictions on mass gatherings, travel and work.

Zou et al. [Reference Zou50] developed a SEIR model that takes unreported/untested cases into account (named SuEIR). This model, combined with machine learning on data from 22 March to 3 May and data validation between 4 May to 10 May 2020, predicts that Covid-19 infections would have peaked on 1 June 2020, and that there would have been 123 400 total deaths by 30 June 2020. In contrast, after we did our modelling, the IHME Covid-19 Forecasting Team [51] also used a compartmental (SEIR) model with inhomogeneous population mixing (as also ours) to test the impact of non-pharmaceutical interventions on infections and deaths, where changes in population mobility, mask use and social distancing mandates were captured with a time-dependent contact rate. They predict that there will be 430 000 cumulative deaths by 31 December 2020 if social distancing measures are removed, 295 000 deaths if social distancing mandates are imposed when 8 deaths per million residents is surpassed in each state, and 192 000 deaths for a scenario where 95% of the population wears masks and social distancing mandates are imposed at 8 deaths per million. In contrast, our model takes into account all of the critical parameters: undetected infected, possible loss of immunity, sequestration measures, finite detection rates and inhomogeneous mixing between the undetected infected and susceptible populations.

Our model predicts significantly more Covid-19 cases and deaths, with an extended duration past 2 years for the majority of states examined. We aim to extend our predictions to include mask use by incorporating a time-dependent β.

A Covid-19 SEIRS model (where recovered become susceptible again) including co-infection with additional human coronavirus strains and a periodic basic reproduction number R₀ corresponding to seasonal forcing, combined with US data, predicts that wintertime outbreaks will occur for several years if immunity wanes – as also occurs with other coronaviruses [Reference Kissler52]. This study also predicts that the number of confirmed Covid-19 cases in the first wave strongly depends on the peak value of R₀. Furthermore, social distancing was tested by reducing R₀; applied once this may push the epidemic peak to the autumn, whereas intermittent application can reduce the total number of cases [Reference Kissler52].

Delays in transfers between compartments (such as our pseudo-quarantine), and in transferring between several compartments (effectively causing a delay) prior to re-entry in the susceptible population are known to cause oscillations in SIR-type models [Reference Hethcote, Stech and van den Driessche44] – reintroducing individuals into the susceptible compartment de-stabilises the steady rate of infections. Temporary immunity, modelled by our reintroduction of recovered and undetected recovered populations into the susceptible compartment, as well as relaxation of shelter-in-place orders, can produce yearly oscillations such as found in influenza and other human corona viruses (i.e. ‘common colds’) [Reference Callow23, Reference Kyrychko and Blyuss53]. It is possible that without a yearly vaccination program Covid-19 will become endemic in the United States with annual spikes in cases.

In conclusion, we have developed a compartment model taking into account social distancing, undetected infecteds and possible loss of immunity – all issues which are relevant for Covid-19. The model describes current data very well for the states selected for study; this more realistic picture of the disease growth is likely due to both using a larger number of compartments than traditional SIR-type models, and to considering additional nonlinearity in the infectious power of the disease. While projections based on the model are not wholly optimistic, they do point to the fact that it is quite possible to avoid more severe outcomes with stronger measures – increased detection, mask mandates and strict stay-at-home adherence – than have been pursued so far.

Appendices

Methods

All numerical simulation for equations 1–8, fits and data management were done in Matlab. Data for cumulative confirmed cases and deaths were obtained from the Johns Hopkins University (JHU) Center for Systems Science and Engineering, which has been making highly credible US and global Covid-19 time-series statistics available to the public on the GitHub [34] website. Raw data in the original CSV files were converted to Matlab table data structures for ease of access; since the US data were broken out by municipality/county, it was necessary to aggregate this to create each state-wide time series. 2019 estimates of state populations used for normalisation were acquired from the US Census Bureau [54].

Least-squares fits of numerically generated curves to the data were obtained with Matlab's lsqcurvefit using a trust-region-reflective algorithm; this is a variant of the conjugate gradient method designed for large-scale bound-constrained minimisation problems [55]. Hence one of the benefits of this fitting routine is that fit parameters can be given bounds or fixed values (the latter being especially useful during model development and testing). Fit parameters were: all model rate parameters (β, ε, δ, α, γ and ρ), power law exponent a, initial condition for unknown infecteds U(0), plus two pairs of parameters governing sequestration of populations due to social distancing – peak q_i values and dates of application t_i where i = 1,2. Fits were done in two stages. When work was begun on this project, we had data from 22 January to 9 May, which allowed us to make initial fits for all rate parameters, infectious power, initial conditions and lockdown effects. In the course of writing the results up, we realised we would need to incorporate the effect of breaking developments (cessation of state stay-at-home orders and new spikes in cases), so a second fit was done to determine the time and magnitude of the release of lockdown (q ₂, t ₂) keeping all of the previous parameter values unchanged.

The algorithm used by lsqcurvefit, like other gradient methods, iteratively navigates through a series of successively better solutions until it finds a local minimum, determined mainly by detection of apparent convergence of the target value (in this case, squared error). Since all the rates are bound between 0 and 1, the t_i parameters were rescaled by the total time of the simulation to fall within the same range (this helps the fitting routine when determining step sizes while revising the current solution). Fits were made comparing certain selected and aggregated simulation results against normalised JHU data for cumulative confirmed cases and deaths, simultaneously. lsqcurvefit default values were used for tolerances, iterations and step size, but because the proportions of state populations were so small simulation results and normalised data were rescaled to increase the magnitude of the error, preventing the fit routine from prematurely settling for a solution (the scaling formula was chosen so that the norm of the rescaled test data was equal to the number of elements in the test data matrix).

Simulation results were generated using Matlab's ode45 function, which uses an explicit Runge−Kutta (4, 5) formula. Like almost all numerical ODE solvers, these work iteratively by extending a known value y_t to t + Δt by evaluating the derivatives of y at t; RK methods use a sophisticated weighting scheme to correct for the deviations that accumulate using linear extrapolations to estimate a nonlinear function. Default settings were used for the solver, except that the maximum step size was constrained to be ≤ 0.5 days (this prevents the solver, which uses an adaptive step size, from accidentally stepping over the sequestration date t ₁). The implementation of the model itself, coded in a function that is given as an argument to ode45, is for the most part straightforward; the only aspect that requires any further comment is the handling of the sequestration function q(t).

As mentioned in the model description, the effect of stay-at-home orders and other social distancing measures is modelled by shifting a segment of the susceptible and unknown infected population into the Q bin, where they cannot be infected or infect others, as the case may be. The interface of ode45 requires that the user supply a subroutine to evaluate the derivatives of the target function at time t, but does not allow direct manipulation of the target function values themselves in mid-run; as well, the user has no way to force the solver to do a derivative evaluation at any particular time. Since the system of ODEs works by transferring populations at various rates between compartments, we resolve these issues by using a timed pulse, i.e. a rate which is generally 0 or close to 0, but which may change any time the solver calls the model subroutine, being relatively large close to the set activation time. For this we use the value of a Gaussian curve at x = t with mean value t_i, where the height and width are set so as to achieve the sequestration we desire (easily calculated using Matlab's normpdf function). Since ode45's adaptive step size decreases when it detects unexpected movement in the q rate, a well resolved sampling of different values near the peak t_i is obtained. To make analysis more straightforward, we decided that the q _i values given to the solver would be equal to the total sequestration effect (e.g. if we set q _i to 0.15, then roughly 15% of the S and U compartments would be moved into the Q compartment within a day or two of t_i). To achieve this, for sequestrations ≤0.625 we normalised the peak rate to the same value, and set the standard deviation to a value between 0.4 and 0.625 determined by trial and error and fit to a cubic polynomial. For sequestrations above 0.625, a much more complicated formula was needed to achieve the desired effect; since such high sequestration never appeared in any of the fits we will omit any further discussion of this, except to say that to get effective clearance (99.84%) of the entire relevant compartment(s) we use a Gaussian with both height and width set to ≈1.6. See Figure 6 for an example of how the S and Q compartments change over time due to the action of the q pulses.

Fig. 6. Computed results for all the compartments (example: New York State). Demonstrating the effect of the ODE model's q pulses timed for 2 April and 29 May on the S (blue) and Q (green) population dynamics. q_i values ≈20% and −25%, respectively. Due to the y-axis scale some compartments are partially occluded.

Lastly, for the purpose of doing counterfactual and hypothetical projections, the model implementation subroutine accepts an arbitrary number of peak rate/activation date pairs tacked onto the end of the parameter vector it takes as one of its arguments. This allows us to test the effect of doing several interventions of possibly different magnitudes. Also, a negative peak rate is implemented as returning the specified proportion of the sequestered population in Q back to S (since ode45 has no facility for keeping track of the ratio of S and U populations that were originally sequestered, it was felt that returning everyone to S was the most sensible approach). To make this feasible codewise, at any particular time t only the q_i with peak time t ^∗_i closest to t is executed. In practice, if peaks are set too close to each other (e.g. within half a week or so) they may interfere with each other's ability to achieve full sequestration or release; but since this is essentially the case in real life as well we thought it not to be a priority to address this issue.

The basic SIR model and SQUIDER

As mentioned in the model description, a basic SIR model underlies the SQUIDER model, so the SIR dynamics given the measured contact and recovery rates and estimated initial condition can be computed by setting all other rate parameters to 0 and keeping the power law exponent a = 1. This can be considered a counterfactual case where no interventions of any kind (medical or social) were attempted. Figure 7a compares the result with the empirical cumulative case count. The plotted dynamics show the inevitable SIR dynamics when the basic reproduction number R ₀ > 1 (for our New York fit it was ≈5.5), with the S compartment decreasing monotonically from 1 to 0, I compartment peaking, and the R compartment increasing monotonically from 0 to 1.

Fig. 7. Applicability of basic SIR model to COVID dynamics (example: New York State). (a) Dynamics of underlying SIR model in SQUIDER fit of NY data (i.e. using only β and values plus initial condition U ₀ in the ODE simulation). (b) Attempted independent fit of basic SIR model to empirical data (cumulative cases = combined infected and recovered).

Is it possible to fit the basic SIR model to the available data with good results? Figure 7b shows one such attempt (again, the example is New York). One can see that the fitting routine has placed its relatively uninflected curve between the various inflections of the empirical data (so that, in fact, the signed errors mostly cancel each other), but is incapable of actually capturing the shape. In this case, for the simple SIR model the error measured by the fitting routine (norm of the residuals) was more than 10 times that of the SQUIDER fit to the same data. The R₀ for this fit is 1.018, which matches well for the effective R obtained for New York and other states once they started taking measures against the virus (see next section). As mentioned in the previous section, which of the often multiple local minima the fitting routine finds is largely dependent on the starting guess for the parameter set; the one shown is definitely the best of the several attempts we made, but we admit we did not try all possible permutations (nor is it necessary). One work [Reference Al-Raeei49] made predictions using fits of a SIRD model (SIR with a separate Deaths compartment, similar to our detected deaths) to data from various countries. Their fits were obtained by a deterministic method that takes advantage of the fact that the basic SIR ODE model has a closed-form solution. However, the values they obtain for the contact and recovery rates are extremely small (on the order of 10⁻⁷), so seem to lose all basis in whatever physical/social meanings the rates have. It should be noted that there are no plots in the paper showing fits against empirical data, and we were not able to make our own since no initial values are given.

The effective reproduction number R_t

In epidemiology, R₀ is ‘the basic reproduction number of a disease’ and denotes the expected number of cases produced by a single infected individual in a completely susceptible population. This number, extensively used in epidemiological modelling, describes whether an epidemic breaks out or not; if the value is less than 1 an outbreak does not result in an epidemic, whereas if it is larger than 1 an epidemic occurs [Reference van den Driessche56]. If R₀ is much larger than 1, then the outbreak will be stronger and faster. In simple SIR models, it is a direct consequence of, indeed the direct product of three factors: transmissibility (a probability or likelihood of becoming infected), the average number of contacts a person has per day and duration of infectiousness (time to recover, 1/α), namely β/α. Our model is indeed more complex than the traditional, prototypical SIR model (due to its having seven distinct, identified compartments); therefore, the reproduction number R₀ necessarily involves additional factors such as q(t), ρ, ε and δ, as well as their initial values (namely at time = 0, the starting point of the modelled epidemic, or of the computational prediction). Since q(t) is time dependent, this necessitates having a time-dependent reproduction number R_t. To compute R_t, we follow the next-generation matrix method [Reference van den Driessche56].

Let x = (x ₁,⋅⋅⋅,x_n) be the number of individuals in each compartment, where m < n compartments contain infected individuals – here the U and I compartments (i.e. m = 2). Consider the model equations written in the form ${\rm d}x_i/{\rm d}t = {\cal F}_i\lpar x\rpar -{\cal V}_i\lpar x\rpar$ for i = 1,2,…,m. Here ${\cal F}$_i(x) is the rate of appearance of new infections in compartment i (i.e. positive terms), and ${\cal V}_{\rm i}\lpar x\rpar$ is the rate of transitions between compartment i and other infected compartments (i.e. negative terms). It is assumed that ${\cal F}$_i = 0 if i ∈ (m + 1,n); i.e. it is zero for compartments that do not describe infected populations. Define matrices $F = \lsqb {\partial {\cal F}_i\lpar {x\lpar 0 \rpar } \rpar /\partial x_j} \rsqb$ and $V = \lsqb {\partial {\cal V}_i\lpar {x\lpar 0 \rpar } \rpar /\partial x_j} \rsqb$ for 1 ≤ i, j ≤ m. Let ψ(0) be the number of infected and undetected infected at the initial time of detection. Then FV ⁻¹ψ(0) gives the expected number of new infections; i.e. the matrix FV ⁻¹ has the (i,j) item equal to the expected number of secondary infections in compartment i produced by an infected individual introduced in compartment j. Then R₀ is given by the largest positive eigenvalue of FV ⁻¹. For our model,

(9)$$\matrix{ {F = \left[{\matrix{ {\beta aS_0U_0^{a-1} } & 0 \cr \delta & 0 \cr } } \right]} \cr } $$

(10)$$\matrix{ {V = \left[{\matrix{ {\delta + \varepsilon + q\lpar 0\rpar } & 0 \cr 0 & {\alpha + \gamma } \cr } } \right]} \cr } $$

which, through taking the product of F and the inverse of V, yields

(11)$$\matrix{ {R_0 = \displaystyle{{\beta aS_0U_0^{a-1} } \over {\delta + \varepsilon + q\lpar 0\rpar }}} \cr } $$

Note that F and V are 2 × 2 matrixes because we have only two compartments of infected populations – Infected (I) and Undetected infected (U). At the beginning of the outbreak (i.e. t = 0) q(0) = 0, S ₀ = S(0) ≈1 and in equation 10 U ₀ = U(0) is a fit parameter. Given this form, we can trivially extend equation 10 for later times by including the time dependence of the S, U and q parameters (so that we can get R_t for all times) as

(12)$$\matrix{ {R_{\rm t} = \displaystyle{{\beta aS\lpar t\rpar U{\lpar t\rpar }^{a-1}} \over {\delta + \varepsilon + q\lpar t\rpar }}} \cr } $$

We show R ₀ and R_t values on 26 April 2020 – during the sequestration period in many states – in Table 2. Note that while many states face endemic Covid-19 infections (in Fig. 4) despite surprisingly having R_t < 1; this is due to our R_t not taking into account reintroduction of recovered people into the Susceptible compartment, which increases the pool of individuals available for infection.

Acknowledgements

This study was supported by TTU President's Distinguished Chair Funds. We acknowledge early encouragement by Dr Jamilur R. Choudhury to study Covid-19, and humbly dedicate this paper to his memory.

Conflict of interest

None.

Data availability statement

Data for cumulative confirmed cases and deaths were obtained from the Johns Hopkins University (JHU) Center for Systems Science and Engineering, posted on the GitHub website [34].

References

Schumaker, E (2020) Timeline: How coronavirus got started. ABC News 2020; 28 July.Google Scholar

Gonzalez-Reiche, A et al. (2020) Introductions and early spread of SARS-CoV-2 in the New York City area. medRxiv. doi: https://doi.org/10.1101/2020.04.08.20056929.Google Scholar PubMed

World Health Organization (2020) Coronavirus disease 2019 (COVID-19): situation report 161. WHO Technical Report 2020; 29 June.Google Scholar

Emanuel, E et al. (2020) Fair allocation of scarce medical resources in the time of Covid-19. New England Journal of Medicine 382, 2049–2055.CrossRef Google Scholar PubMed

Anderson, R et al. (2020) How will country-based mitigation measures influence the course of the COVID-19 epidemic? The Lancet 395, 931–934.Google Scholar PubMed

Brauer, F, Castillo-Chavez, C and Feng, Z (2019) Simple compartmental models for disease transmission. In Mathematical Models in Epidemiology. New York, N.Y.: Springer, pp. 21–61.Google Scholar

Small, M, Shi, P and Tse, CK (2004) Plausible models for propagation of the SARS virus. IEICE Transactions on Fundamentals of Electronics. Communications and Computer Sciences 87, 2379–2386.Google Scholar

Chang, HJ (2017) Estimation of basic reproduction number of the Middle East respiratory syndrome coronavirus (MERS-CoV) during the outbreak in South Korea, 2015. Biomedical Engineering Online 16, 79.Google Scholar PubMed

Lee, JC et al. (2020) See how all 50 states are reopening (and closing again). The New York Times 2020; 24 April.Google Scholar

Center for Systems Science and Engineering (CSSE) at Johns Hopkins University (2020) Coronavirus COVID global cases. Coronavirus Resource Center 2020; April 17:19.Google Scholar

Parmet, WE and Sinha, MS (2020) Covid-19? The law and limits of quarantine. New England Journal of Medicine 382, e28.CrossRef Google Scholar PubMed

Liu, X and Stechlinski, P (2012) Infectious disease models with time-varying parameters and general nonlinear incidence rate. Applied Mathematical Modelling 36, 1974–1994.Google Scholar

Novozhilov, AS (2012) Epidemiological models with parametric heterogeneity: deterministic theory for closed populations. Mathematical Modelling of Natural Phenomena 7, 147–167.CrossRef Google Scholar

Roy, M and Pascual, M (2006) On representing network heterogeneities in the incidence rate of simple epidemic models. Ecological Complexity 3, 80–90.CrossRef Google Scholar

Xu, C et al. (2020) Forecast analysis of the epidemics trend of COVID-19 in the United States by a generalized fractional-order SEIR model. arXiv preprint. arXiv:2004.12541.Google Scholar

Moghadas, SM et al. (2020) Projecting hospital utilization during the COVID-19 outbreaks in the United States. Proceedings of the National Academy of Sciences 117, 9122–9126.Google Scholar PubMed

Maier, BF and Brockmann, D (2020) Effective containment explains subexponential growth in recent confirmed COVID-19 cases in China. Science (New York, N.Y.) 368, 742–746.CrossRef Google Scholar PubMed

Gostin, LO, Hodge, JG and Wiley, LF (2020) Presidential powers and response to COVID-19. Journal of the American Medical Association 323, 1547–1548.Google Scholar PubMed

Mervosh, S, Lu, D and Swales, V (2020) See which states and cities have told residents to stay at home. New York Times 2020; 23 March.Google Scholar

World Health Organization. (2020) “Immunity passports” in the context of COVID-19: scientific brief. WHO Technical Report 2020; 24 April.CrossRef Google Scholar

Long, QX et al. (2020) Clinical and immunological assessment of asymptomatic SARS-COV-2 infections. Nature Medicine 26, 1200–1204.CrossRef Google Scholar PubMed

Centers for Disease Control and Prevention (2020) Evaluating and testing persons for coronavirus disease 2019 (COVID-19). National Center for Immunization and Respiratory Diseases (NCIRD), Division of Viral Diseases 2020; 24 March.Google Scholar

Callow, KA et al. (1990) The time course of the immune response to experimental coronavirus infection of man. Epidemiology & Infection 105, 435–446.Google Scholar

Centers for Disease Control and Prevention (2019) 2014–2016 Ebola outbreak in West Africa. Centers for Disease Control and Prevention, National Center for Emerging and Zoonotic Infectious Diseases (NCEZID), Division of High-Consequence Pathogens and Pathology (DHCPP), Viral Special Pathogens Branch (VSPB). Last reviewed March 8, 2019. Note: majority of cases in U.S. were healthcare workers; see section “Ebola in the United States”.Google Scholar

Ng, K et al. (2020) COVID-19 and the risk to health care workers: a case report. Annals of Internal Medicine 172, 766–767.Google Scholar PubMed

Kennedy, DM et al. (2020) Modeling the effects of intervention strategies on COVID-19 transmission dynamics. Journal of Clinical Virology 128, 104440.Google Scholar PubMed

Li, ML et al. (2020) Forecasting COVID-19 and analyzing the effect of government interventions. medRxiv. 2020.06.23.20138693.Google Scholar

Basu, A (2020) Estimating the infection fatality rate among symptomatic COVID-19 cases in the United States: study estimates the COVID-19 infection fatality rate at the US county level. Health Affairs 39, 1229–1236.Google Scholar

Weinberger, D et al. (2020) Estimating the early death toll of COVID-19 in the United States. medRxiv. 2020.04.15.20066431.Google Scholar PubMed

Cain, DW and Cidlowski, JA (2020) After 62 years of regulating immunity, dexamethasone meets COVID-19. Nature Reviews Immunology. doi: https://doi.org/10.1038/s41577-020-00421-x.Google Scholar PubMed

Bump, P (2020) Fauci puts it bluntly: Coronavirus deaths are undercounted. The Washington Post 2020; 12 May.Google Scholar

Rothfield, M et al. (2020) 13 deaths in a day: An ‘Apocalyptic’ coronavirus surge at an N.Y.C. hospital. New York Times 2020; 25 March.Google Scholar

Duffy, C (2020) Almost 65,000 COVID cases in NJ, 7 hospitals hit capacity. pix11.com 2020; 13 April.Google Scholar

Johns Hopkins University (2020) Available at https://github.com/CSSEGISandData/COVID-19/tree/master/csse_covid_19_data/csse_covid_19_time_series.Google Scholar

Liu, Y et al. (2020) The reproductive number of COVID-19 is higher compared to SARS coronavirus. Journal of Travel Medicine 27, taaa021.Google Scholar PubMed

Jones, J. (2020) Spanish antibody study points to 5% of population affected by coronavirus. Reuters 2020; 13 May. Note: study was conducted by the Carlos III Institute for Health and the Spanish National Statistics Institute.Google Scholar

Gaeta, G (2020) A simple SIR model with a large set of asymptomatic infectives. arXiv preprint. arXiv:2003.08720v1.Google Scholar

Roser, M et al. (2020) Mortality risk of COVID-19. Our World in Data 2020. Available at https://ourworldindata.org/mortality-risk-covid.Google Scholar

Mervosh, S, Lee, JC and Popovich, N (2020) See which states are reopening and which are still shut down. The New York Times 2020; April.Google Scholar

Bartels, L and Achen, C (2016) Democracy for Realists. Princeton: Princeton University Press.Google Scholar

Debenedetto, P and Watkins, K. (2020) Harris county orders public to stay indoors amid coronavirus pandemic. Houston Public Media 2020; 24 March.Google Scholar

Pinotti, F et al. (2020) Lessons learnt from 288 COVID-19 international cases: importations over time, effect of interventions, underdetection of imported cases. medRxiv. 2020.02.24.20027326.Google Scholar

Chinazzi, M et al. (2020) The effect of travel restrictions on the spread of the 2019 novel coronavirus (COVID-19) outbreak. Science (New York, N.Y.) 368, 395–400.Google Scholar PubMed

Hethcote, HW, Stech, HW and van den Driessche, P (1981) Nonlinear oscillations in epidemic models. SIAM Journal on Applied Mathematics 40, 1–9.Google Scholar

Peirlinck, M et al. (2020) Outbreak dynamics of COVID-19 in China and the United States. Biomechanics and Modeling in Mechanobiology.Google Scholar PubMed

Kriston, L. (2020) Projection of cumulative coronavirus disease 2019 (COVID-19) case growth with a hierarchical logistic model. Bulletin of the World Health Organization COVID-19 Open Preprints 2020. doi: http://dx.doi.org/10.2471/BLT.20.257386.Google Scholar

Uhlig, S et al. (2020) Modeling projections for COVID-19 pandemic by combining epidemiological, statistical, and neural network approaches. medRxiv. 2020.04.17.20059535.Google Scholar

Aboelkassem, Y (2020) COVID-19 pandemic: a Hill type mathematical model predicts the US death number and the reopening date. medRxiv. 2020.04.12.20062893.Google Scholar

Al-Raeei, M (2020) The forecasting of COVID-19 with mortality using SIRD epidemic model for the United States, Russia, China, and the Syrian Arab Republic. AIP Advances 10, 065325.Google Scholar

Zou, D et al. (2020) Epidemic model guided machine learning for COVID-19 forecasts in the United States. medRxiv. 2020.05.24.20111989.Google Scholar

IHME COVID-19 Forecasting Team, Hay SI (2020) COVID-19 scenarios for the United States. medRxiv. 2020.07.12.20151191.Google Scholar

Kissler, SM et al. (2020) Projecting the transmission dynamics of SARS-CoV-2 through the postpandemic period. Science (New York, N.Y.) 368, 860–868.Google Scholar PubMed

Kyrychko, YN and Blyuss, KB (2005) Global properties of a delayed SIR model with temporary immunity and nonlinear incidence rate. Nonlinear Analysis: Real World Applications 6, 495–507.Google Scholar

United States Census Bureau (2020) Available at https://www2.census.gov/programs-surveys/popest/datasets/2010-2019/national/totals.Google Scholar

The MathWorks, Inc. (2017) Optimization Toolbox User's Guide: Least-Squares (Model Fitting) Algorithms, r2017a edition, March 2017.Google Scholar

van den Driessche, P (2017) Reproduction numbers of infectious disease models. Infectious Disease Modelling 2, 288–303.Google Scholar PubMed

Fig. 1. Schematic of the compartments, with the rates of transfer between the compartments.

Fig. 2. SQUIDER model fits. Fits of our compartment model to recorded data on confirmed cumulative case counts and deaths; all fits have R2 ≥ 0.996. Data were obtained from The Johns Hopkins University [34]. The vertical dotted line indicates the last date fitting data was obtained for.

Table 1. SQUIDER model fit parameters for selected US states

Table 2. Basic R0 and effective Rt reproduction number values for selected US states

Fig. 6. Computed results for all the compartments (example: New York State). Demonstrating the effect of the ODE model's q pulses timed for 2 April and 29 May on the S (blue) and Q (green) population dynamics. qi values ≈20% and −25%, respectively. Due to the y-axis scale some compartments are partially occluded.

Fig. 7. Applicability of basic SIR model to COVID dynamics (example: New York State). (a) Dynamics of underlying SIR model in SQUIDER fit of NY data (i.e. using only β and values plus initial condition U0 in the ODE simulation). (b) Attempted independent fit of basic SIR model to empirical data (cumulative cases = combined infected and recovered).

Article contents

A predictive model for Covid-19 spread – with application to eight US states and how to end the pandemic

Abstract

Keywords

Introduction

A new model

Assumptions

Results

Coefficient evaluation

Model predictions

Sensitivity to intervention level

Discussion

Appendices

Methods

The basic SIR model and SQUIDER

The effective reproduction number Rt

Acknowledgements

Conflict of interest

Data availability statement

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests

The effective reproduction number R_t