COVID-19 short-term forecasts Confirmed 2020-06-29 Latin American Countries


Disclaimer

  • Forecasts produced by Jennie Castle, Jurgen Doornik, and David Hendry, researchers at the University of Oxford. These are our forecasts, and should not be considered official forecasts from, or endorsed by, any of: University of Oxford, Oxford Martin School, Nuffield College, or Magdalen College.
  • These forecasts are short term time-series extrapolations of the data. They are not based on epidemiological modelling or simulations. The documentation that is provided is still in progress and not peer reviewed. All forecasts are uncertain: their success can only be determined afterwards. Many mitigation strategies are in place, which, if successful, invalidate our forecasts. An explanation of our methods is provided below.

Recent changes

[2020-03-24] Our forecasts are starting to overestimate in some cases. This was always expected to happen when the increase starts to slow down. Scenario forecasts that are based on what happened in China earlier this year, but only for Italy and Spain sofar.
[2020-03-26] Scenario forecasts that are based on what happened in China earlier this year, only for Italy.
[2020-03-31] Scenario forecasts, based on what happened in China earlier this year, are presented for several countries (line marked with x). Created more plausible 90% confidence bands (dotted line in same colour).
[2020-04-02] Now including more US States, based on New York Times data.
[2020-04-06] Added a post hoc estimate of the peak number of cases. This needs at least three confirmed observations (four for deaths) after the event. It is based on the averaged smooth trend, and can change later or be a local peak. It is marked with a vertical line with the date label, or a date with left arrow in the bottom left corner of the graph. This is backported to 2020-04-04.
[2020-04-08] Minor correction to peak estimates. Added table with scenario forecasts.
[2020-04-09] Added table with estimated peak dates (if happened) and dates to and since the peak. Note that this can be a local peak, and subsequent re-acceleration (or data revisions) can result in a new peak later.
[2020-04-10] Updated documentation with better description of short-term estimates and peak determination.
[2020-04-16] Added scenario forecasts to all graphs now. This would now be the preferred forecast for most.
This is the first time with a peak in confirmed UK cases (also for deaths, but this is uncertain because it is at the same date).
[2020-04-17] Bird and Nielsen look into nowcasting death counts in England.
[2020-04-24] A summary of our work on short-term COVID-19 forecasting appeared as a voxeu.
[2020-04-27] Our short-term COVID-19 forecasting paper is now available as Nuffield Economics Discussion Paper 2020-W06.
A small adjustment has been made to the scenario forecast methodology, and will be documented shortly.
[2020-04-29] See our blog entry at the International Institute of Forecasters.
US history of death counts revised in Johns Hopkins/CSSE data.
UK death counts have been revised to include the deaths in care homes. In the Johns Hopkins/CSSE data set, which we use, the entire history has been revised. So forecasts made up to 2020-04-29 cannot be compared to later outcomes. In the ECDC data set only the last observation has changed, causing a jump in the series.
[2020-05-06] The New York Times is in the process of redefining its US state data. Unfortunately, at the moment only the last observation has changed (e.g New York deaths jumped from 19645 on 2020-05-05 to 25956 a day later). This means the data is currently useless; however it does bring it close to the Johns Hopkins/CSSE count (25626 on 2020-05-06). The aggregate US count is based on JH/CSSE so unaffected. We now use Johns Hopkins/CSSE US state data, including all states with sufficient counts. So the new forecasts cannot be compared to those previously.
A minor change is that we show the graph without scenario forecast if no peak has been detected yet.
[2020-05-13] We now omit countries with fewer than 200 confirmed cases in the last week (25 for deaths).
The short-term paper has some small updates, including further comparisons with other models.
Data for Ecuador are not reliable enough for forecasting.
Switched to an improved version of scenario forecasting.
[2020-05-18] Minor fixes to the improved version of scenario forecasting, backported to 2020-05-13.
[2020-05-20] Problem with UK confirmed cases: negative daily count. This makes the forecasts temporarily unreliable.
Updated the second paper.
[2020-06-04] Data issues with confirmed cases for France.
Added an appendix to the short term paper with further forecast comparisons for European and Latin American countries.
Both Sweden and Iran have lost their peak in confirmed cases. For Sweden the previous peak was on 24 April (daily peak of 656 cases), for Iran it was on 31 March (peak of 3116). For Iran this looks like a second wave, with increasing daily counts for the last four weeks. For Sweden this is a sudden jump in confirmed cases in the last two days, compared to a fairly steady weekly pattern over the previous six weeks.
[2020-06-06] Removed Brazil from yesterday's forecasts (only; last observation 2020-06-05).
[2020-06-24] Research presentation on short-term COVID-19 forecasting on 26 June (14:00 UK time) at the Quarterly Forecasting Forum of the IIF UK Chapter.
[2020-06-29] Tables in April included the world, but not the world as we know it (double counting China and the US). So removed the world from those old tables.
Why short-term forecasts can be better than models for predicting how pandemics evolve just appeared at The Conversation.
Thursday 2 July webinar at the FGV EESP - São Paolo School of Economics. This starts at 16:00 UK time (UTC+01:00) and will be streamed here.

Further information

  • We believe these forecasts fill a useful gap in the short run. They give an indication of what is likely to happen in the next few days, removing some aspect of surprise. Moreover, a noticeable drop in comparison to the extrapolations could be an indication that the implemented policies are having some impact. It is difficult to understand exponential growth. We hope that these forecasts may help to convince viewers to adhere to the policies implemented by their respective governments, and keep all arguments factual and measured.
  • We use the data repository for the 2019 Novel Coronavirus Visual Dashboard operated by the Johns Hopkins University Center for Systems Science and Engineering. This is updated daily, but we tend to update our forecasts only every other day.
    US state data as of 2020-03-28 is courtesy of the New York Times.
  • We can only provide forecasts of what is measured. If confirmed cases are an underestimate of actual cases, then our forecasts will also be underestimates. No other epidemiological data is used. Data definition and collection differs between countries and may change over time.
  • We will update the methodology as we learn what is happening in the next few days or weeks. Once the number of cases levels off, there is no need to provide these forecasts anymore.
  • Countries where the counts are very low or stable have been omitted.
  • The graphs have dates on the horizontal axis (yyyy-mm-dd) and cumulative counts on the vertical axis. They show
    1. bold dark grey line (with circles): observed counts (Johns Hopkins CSSE);
    2. many light grey lines (with open circles): forecasts using different model settings and starting up to four periods back;
    3. red line (with open circles): single forecasts path using default model settings;
    4. black line (with crosses): average of all forecasts, recentered on the last observation;
    5. thin green lines: some indication of uncertainty around the red forecasts, but we do not know how reliable that is.
    Both the red line forecasts and the black lines are also given in the tables above. These forecasts differ, we are currently inclined to use the average forecasts.
  • The forecasts are constructed as follows:
    1. An overall `trend' is extracted by taking a window of the data at a time. In each window we draw `straight lines' which are selected using an automatic econometric procedure (`machine learning'). All straight lines are collected and averaged, giving the trend.
    2. Forecasts are made using the estimated trend, but we note that this must be done carefully, because simply extrapolating the flexible insample trend would lead to wildly fluctuating forecast. We use the `Cardt' method, which has been found to work well in other settings.
    3. Residuals from the trend are also forecast, and combined with trend forecasts into an overall forecast.
  • Scenario forecasts are constructed very differently: smooth versions of the Chinese experience are matched at different lag lengths with the path of each country. This probably works best from the peak, or the slowdown just before (but we include it for the UK nonetheless).
  • The forecast evaluation shows past forecasts, together with the outcomes (in the grey line with circles).
  • EU-BS is Estonia, Latvia, and Lithuania together.
  • This paper describes the methodology and gives further references. Also available as Nuffield Economics Discussion Paper 2020-W06. Still preliminary is the documentation of the medium term forecasts.

Confirmed count average forecast Latin America (bold black line in graphs) 2020-06-30 to 2020-07-06

DateArgentinaBoliviaBrazilChileColombiaCosta RicaDominican RepublicEcuadorEl SalvadorGuatemalaHaitiHondurasMexicoPanamaParaguayPeruVenezuela
2020-06-29 62268 32125 1368195 275999 91995 3269 31816 55665 6173 17409 5933 18818 220657 32785 2191 282365 5530
2020-06-30 64700 33000 1400000 278000 96000 3420 32600 56200 6390 17800 5990 19600 224000 33800 2280 286000 5720
2020-07-01 67300 34000 1436000 281000 100000 3580 33600 56800 6630 18200 6060 20600 229000 35000 2390 289000 5960
2020-07-02 69900 35000 1472000 284000 105000 3740 34600 57500 6880 18700 6130 21600 233000 36200 2520 292000 6220
2020-07-03 72700 36000 1510000 287000 110000 3910 35700 58100 7140 19200 6210 22600 238000 37400 2650 295000 6480
2020-07-04 75600 37000 1548000 290000 115000 4090 36800 58700 7400 19700 6300 23700 243000 38700 2810 299000 6760
2020-07-05 78700 38100 1588000 294000 120000 4280 38000 59400 7680 20300 6380 24900 248000 40000 2970 302000 7040
2020-07-06 81900 39300 1628000 297000 126000 4480 39100 60000 7970 20800 6470 26100 253000 41400 3140 305000 7340

Confirmed count forecast Latin America (bold red line in graphs) 2020-06-30 to 2020-07-06

DateArgentinaBoliviaBrazilChileColombiaCosta RicaDominican RepublicEcuadorEl SalvadorGuatemalaHaitiHondurasMexicoPanamaParaguayPeruVenezuela
2020-06-29 62268 32125 1368195 275999 91995 3269 31816 55665 6173 17409 5933 18818 220657 32785 2191 282365 5530
2020-06-30 64600 33000 1400000 279000 96000 3390 32600 56300 6400 17800 6000 20000 223000 33800 2200 286000 5710
2020-07-01 66900 33800 1432000 283000 100000 3510 33400 56900 6650 18200 6090 21100 226000 34900 2200 289000 5920
2020-07-02 69300 34600 1464000 286000 104000 3640 34300 57500 6900 18700 6170 22300 229000 36000 2210 292000 6130
2020-07-03 71800 35400 1497000 289000 109000 3780 35200 58100 7160 19100 6260 23500 232000 37200 2210 295000 6350
2020-07-04 74300 36300 1530000 292000 113000 3920 36100 58700 7430 19600 6340 24700 235000 38400 2220 298000 6580
2020-07-05 77000 37200 1564000 296000 117000 4060 37000 59300 7710 20100 6430 26100 239000 39700 2220 301000 6820
2020-07-06 79700 38100 1599000 299000 122000 4210 38000 59900 8000 20600 6520 27500 242000 41000 2220 305000 7070

Confirmed count scenario forecast (bold purple line in graphs) 2020-06-30 to 2020-07-08

DateArgentinaBoliviaBrazilChileColombiaCosta RicaDominican RepublicEcuadorEl SalvadorGuatemalaHaitiHondurasMexicoPanamaParaguayPeruVenezuela
2020-06-29 62268 32125 1368195 275999 91995 3269 31816 55665 6173 17409 5933 18818 220657 32785 2191 282365 5530
2020-06-30 64600 33100 1403000 279000 97000 3420 32500 56200 6340 17700 5950 19600 225000 33500 2360 285000 5740
2020-07-01 66300 33900 1436000 282000 99000 3520 33100 56800 6520 18100 6010 20400 228000 34300 2490 288000 5940
2020-07-02 68400 34600 1464000 287000 102000 3640 33600 57200 6670 18400 6080 21100 232000 35000 2630 290000 6120
2020-07-03 70000 35300 1486000 290000 104000 3710 34100 57700 6830 18800 6130 21800 235000 35700 2720 293000 6250
2020-07-04 72000 35900 1506000 291000 106000 3800 34500 58000 6940 19000 6170 22400 239000 36200 2920 295000 6390
2020-07-05 74000 36400 1524000 292000 108000 3860 34800 58200 7040 19300 6210 22900 243000 36600 3040 296000 6500
2020-07-06 75800 37100 1543000 293000 110000 3940 35300 58300 7170 19600 6240 23500 244000 37100 3080 298000 6610
2020-07-07 77600 37500 1565000 294000 111000 4020 35700 58300 7270 19900 6270 24000 247000 37700 3240 299000 6770
2020-07-08 78600 38100 1589000 294000 113000 4080 36100 58500 7360 20100 6300 24500 250000 38200 3330 300000 6890

Peak increase in estimated trend of Confirmed in Latin America 2020-06-29

ArgentinaBoliviaBrazilChileColombiaCosta RicaDominican RepublicEcuadorEl SalvadorGuatemalaHaitiHondurasMexicoNicaraguaPanamaParaguayPeruVenezuela
Peak date --06-2406-2606-17 -- -- --04-24 --06-2306-07 --06-2505-26 -- --05-31 --
Peak daily increment 1042 37170 16274 3836 552 205 5581 119 6759
Days from 100 to peak 85 104 93 37 75 33 99 7 75
Days from peak/2 to peak 66 77 63 26 64 33 78 7 53
Last total 62268 32125 1368195 275999 91995 3269 31816 55665 6173 17409 5933 18818 220657 2170 32785 2191 282365 5530
Last daily increment 2335 601 24052 4017 0 139 443 410 239 479 156 736 3805 0 1099 64 2946 233
Last week 15065 5736 222289 25232 18235 901 3880 4022 1200 2869 609 4875 29247 0 5471 769 21555 1343
Days since peak 5 3 12 66 6 22 4 34 29

Initial visual evaluation of forecasts of Confirmed