COVID-19 short-term forecasts Confirmed 2020-06-01


Disclaimer

  • Forecasts produced by Jennie Castle, Jurgen Doornik, and David Hendry, researchers at the University of Oxford. These are our forecasts, and should not be considered official forecasts from, or endorsed by, any of: University of Oxford, Oxford Martin School, Nuffield College, or Magdalen College.
  • These forecasts are short term time-series extrapolations of the data. They are not based on epidemiological modelling or simulations. The documentation that is provided is still in progress and not peer reviewed. All forecasts are uncertain: their success can only be determined afterwards. Many mitigation strategies are in place, which, if successful, invalidate our forecasts. An explanation of our methods is provided below.

Recent changes

[2020-03-24] Our forecasts are starting to overestimate in some cases. This was always expected to happen when the increase starts to slow down. Scenario forecasts that are based on what happened in China earlier this year, but only for Italy and Spain sofar.
[2020-03-26] Scenario forecasts that are based on what happened in China earlier this year, only for Italy.
[2020-03-31] Scenario forecasts, based on what happened in China earlier this year, are presented for several countries (line marked with x). Created more plausible 90% confidence bands (dotted line in same colour).
[2020-04-02] Now including more US States, based on New York Times data. And the world.
[2020-04-06] Added a post hoc estimate of the peak number of cases. This needs at least three confirmed observations (four for deaths) after the event. It is based on the averaged smooth trend, and can change later or be a local peak. It is marked with a vertical line with the date label, or a date with left arrow in the bottom left corner of the graph. This is backported to 2020-04-04.
[2020-04-08] Minor correction to peak estimates. Added table with scenario forecasts.
[2020-04-09] Added table with estimated peak dates (if happened) and dates to and since the peak. Note that this can be a local peak, and subsequent re-acceleration (or data revisions) can result in a new peak later.
[2020-04-10] Updated documentation with better description of short-term estimates and peak determination.
[2020-04-16] Added scenario forecasts to all graphs now. This would now be the preferred forecast for most.
This is the first time with a peak in confirmed UK cases (also for deaths, but this is uncertain because it is at the same date).
[2020-04-17] Bird and Nielsen look into nowcasting death counts in England.
[2020-04-24] A summary of our work on short-term COVID-19 forecasting appeared as a voxeu.
[2020-04-27] Our short-term COVID-19 forecasting paper is now available as Nuffield Economics Discussion Paper 2020-W06.
A small adjustment has been made to the scenario forecast methodology, and will be documented shortly.
[2020-04-29] See our blog entry at the International Institute of Forecasters.
US history of death counts revised in Johns Hopkins/CSSE data.
UK death counts have been revised to include the deaths in care homes. In the Johns Hopkins/CSSE data set, which we use, the entire history has been revised. So forecasts made up to 2020-04-29 cannot be compared to later outcomes. In the ECDC data set only the last observation has changed, causing a jump in the series.
[2020-05-06] The New York Times is in the process of redefining its US state data. Unfortunately, at the moment only the last observation has changed (e.g New York deaths jumped from 19645 on 2020-05-05 to 25956 a day later). This means the data is currently useless; however it does bring it close to the Johns Hopkins/CSSE count (25626 on 2020-05-06). The aggregate US count is based on JH/CSSE so unaffected. We now use Johns Hopkins/CSSE US state data, including all states with sufficient counts. So the new forecasts cannot be compared to those previously.
A minor change is that we show the graph without scenario forecast if no peak has been detected yet.
[2020-05-13] We now omit countries with fewer than 200 confirmed cases in the last week (25 for deaths).
The short-term paper has some small updates, including further comparisons with other models.
Data for Ecuador are not reliable enough for forecasting.
Switched to an improved version of scenario forecasting.
[2020-05-18] Minor fixes to the improved version of scenario forecasting, backported to 2020-05-13.
[2020-05-20] Problem with UK confirmed cases: negative daily count. This makes the forecasts temporarily unreliable.
Updated the second paper.

Further information

  • We believe these forecasts fill a useful gap in the short run. They give an indication of what is likely to happen in the next few days, removing some aspect of surprise. Moreover, a noticeable drop in comparison to the extrapolations could be an indication that the implemented policies are having some impact. It is difficult to understand exponential growth. We hope that these forecasts may help to convince viewers to adhere to the policies implemented by their respective governments, and keep all arguments factual and measured.
  • We use the data repository for the 2019 Novel Coronavirus Visual Dashboard operated by the Johns Hopkins University Center for Systems Science and Engineering. This is updated daily, but we tend to update our forecasts only every other day.
    US state data as of 2020-03-28 is courtesy of the New York Times.
  • We can only provide forecasts of what is measured. If confirmed cases are an underestimate of actual cases, then our forecasts will also be underestimates. No other epidemiological data is used. Data definition and collection differs between countries and may change over time.
  • We will update the methodology as we learn what is happening in the next few days or weeks. Once the number of cases levels off, there is no need to provide these forecasts anymore.
  • Countries where the counts are very low or stable have been omitted.
  • The graphs have dates on the horizontal axis (yyyy-mm-dd) and cumulative counts on the vertical axis. They show
    1. bold dark grey line (with circles): observed counts (Johns Hopkins CSSE);
    2. many light grey lines (with open circles): forecasts using different model settings and starting up to four periods back;
    3. red line (with open circles): single forecasts path using default model settings;
    4. black line (with crosses): average of all forecasts, recentered on the last observation;
    5. thin green lines: some indication of uncertainty around the red forecasts, but we do not know how reliable that is.
    Both the red line forecasts and the black lines are also given in the tables above. These forecasts differ, we are currently inclined to use the average forecasts.
  • The forecasts are constructed as follows:
    1. An overall `trend' is extracted by taking a window of the data at a time. In each window we draw `straight lines' which are selected using an automatic econometric procedure (`machine learning'). All straight lines are collected and averaged, giving the trend.
    2. Forecasts are made using the estimated trend, but we note that this must be done carefully, because simply extrapolating the flexible insample trend would lead to wildly fluctuating forecast. We use the `Cardt' method, which has been found to work well in other settings.
    3. Residuals from the trend are also forecast, and combined with trend forecasts into an overall forecast.
  • Scenario forecasts are constructed very differently: smooth versions of the Chinese experience are matched at different lag lengths with the path of each country. This probably works best from the peak, or the slowdown just before (but we include it for the UK nonetheless).
  • The forecast evaluation shows past forecasts, together with the outcomes (in the grey line with circles).
  • EU-BS is Estonia, Latvia, and Lithuania together.
  • This paper describes the methodology and gives further references. Also available as Nuffield Economics Discussion Paper 2020-W06. Still preliminary is the documentation of the medium term forecasts.

Confirmed count average forecast Latin America (bold black line in graphs) 2020-06-02 to 2020-06-08

DateArgentinaBoliviaBrazilChileColombiaDominican RepublicEcuadorEl SalvadorGuatemalaHaitiHondurasMexicoPanamaPeru
2020-06-01 17415 10531 526447 105158 29384 17572 39098 2582 5336 2226 5362 93435 13837 170039
2020-06-02 18100 11100 544000 110000 30100 17900 39300 2660 5430 2380 5460 96000 14300 179000
2020-06-03 18700 11800 566000 116000 31400 18200 39500 2760 5580 2590 5590 99000 14900 188000
2020-06-04 19500 12500 589000 122000 32800 18600 39700 2870 5780 2830 5740 102000 15400 198000
2020-06-05 20200 13300 613000 128000 34300 18900 40000 2990 5970 3100 5900 106000 16000 209000
2020-06-06 21000 14100 637000 135000 35800 19300 40200 3110 6180 3390 6060 109000 16700 220000
2020-06-07 21800 15000 664000 142000 37500 19700 40400 3230 6390 3710 6230 112000 17300 232000
2020-06-08 22600 15900 691000 149000 39100 20000 40700 3360 6610 4060 6400 116000 18000 244000

Confirmed count forecast Latin America (bold red line in graphs) 2020-06-02 to 2020-06-08

DateArgentinaBoliviaBrazilChileColombiaDominican RepublicEcuadorEl SalvadorGuatemalaHaitiHondurasMexicoPanamaPeru
2020-06-01 17415 10531 526447 105158 29384 17572 39098 2582 5336 2226 5362 93435 13837 170039
2020-06-02 18000 11100 539000 111000 30500 17900 39300 2670 5540 2410 5460 96000 14300 180000
2020-06-03 18600 11700 553000 117000 32000 18200 39500 2760 5740 2580 5580 99000 14800 190000
2020-06-04 19300 12300 567000 124000 33600 18500 39700 2850 5950 2790 5700 101000 15200 200000
2020-06-05 19900 12900 581000 131000 35200 18900 39800 2950 6160 3000 5810 104000 15700 211000
2020-06-06 20600 13500 595000 138000 36900 19200 40000 3050 6380 3220 5930 107000 16300 223000
2020-06-07 21300 14200 610000 146000 38700 19500 40200 3150 6610 3460 6050 110000 16800 235000
2020-06-08 22000 14900 626000 154000 40600 19900 40300 3260 6850 3720 6160 113000 17300 247000

Confirmed count scenario forecast (bold purple line in graphs) 2020-06-02 to 2020-06-10

DateArgentinaBoliviaBrazilChileColombiaDominican RepublicEcuadorEl SalvadorGuatemalaHaitiHondurasMexicoPanamaPeru
2020-06-01 17415 10531 526447 105158 29384 17572 39098 2582 5336 2226 5362 93435 13837 170039
2020-06-02 18100 11000 548000 109000 29900 17900 39300 2670 5470 2530 5500 96000 14400 176000
2020-06-03 18800 11500 568000 112000 31000 18200 39400 2750 5660 2830 5650 98000 14700 181000
2020-06-04 19600 12000 590000 117000 32000 18500 39500 2850 5890 3170 5830 102000 15200 185000
2020-06-05 20200 12300 602000 121000 33000 18800 39600 2940 6090 3420 6010 104000 15500 188000
2020-06-06 20900 12800 618000 125000 33900 19000 39700 3020 6360 3720 6180 106000 16400 192000
2020-06-07 21600 13200 632000 129000 35200 19200 39800 3090 6580 4010 6450 108000 16600 192000
2020-06-08 22200 13600 641000 133000 36000 19400 39900 3150 6810 4180 6680 109000 16800 194000
2020-06-09 22700 14000 652000 136000 37200 19600 40000 3210 7020 4470 6910 111000 16900 196000
2020-06-10 23300 14400 664000 140000 38100 19900 40000 3260 7190 4560 7150 113000 17600 199000

Peak increase in estimated trend of Confirmed in Latin America 2020-06-01

ArgentinaBoliviaBrazilChileColombiaDominican RepublicEcuadorEl SalvadorGuatemalaHaitiHondurasMexicoPanamaPeru
Peak date05-28 -- -- -- -- --04-24 --05-24 --05-2405-27 -- --
Peak daily increment 709 3949 296 250 3241
Days from 100 to peak 70 37 45 56 70
Days from peak/2 to peak 65 25 41 55 54
Last total 17415 10531 526447 105158 29384 17572 39098 2582 5336 2226 5362 93435 13837 170039
Last daily increment 564 549 11598 5470 2165 287 0 65 249 102 160 2771 374 5563
Last week 4187 3395 135225 27197 6381 2308 1743 540 1382 1052 961 18875 2390 40288
Days since peak 4 38 8 8 5

Initial visual evaluation of forecasts of Confirmed