This project delivers a time series analysis and short-term forecasting of daily UK Covid-19 case counts using ARIMA models, based on data from January 1, 2020 to June 14, 2020. All analysis was conducted in IBM SPSS, aiming to assess stationarity, build a parsimonious ARIMA model, and forecast cases for the period 15–21 June 2020.
- Source: UK government daily Covid-19 cases data.
- Period: 1 January 2020 – 14 June 2020
- Frequency: Daily (time series, no missing days)
- Variables: Date, Number of new cases
- Forecast window: 15–21 June 2020 (out-of-sample)
- File Provided:
Covid19_uk.xlsx
- Data inspection and cleaning
- Check for stationarity (ACF & PACF of original series)
- Difference the series and confirm stationarity
- Model identification (ARIMA parameter selection)
- Model estimation and diagnostics
- Forecasting for the next 7 days
- Model evaluation (MAE, MSE, residual analysis)
The ACF of the original series shows strong, slowly decaying autocorrelation, indicating non-stationarity. The PACF has a strong lag-1 spike, typical for AR processes.
After first-order differencing, the ACF cuts off rapidly and the PACF shows reduced spikes, confirming the series is now stationary.
- The series was best modelled using an ARIMA(1,1,0) process: 1 autoregressive term, 1 difference, 0 MA.
- Model parameters were significant and model fit judged by AIC/BIC.
- Comparison to ARIMA(0,1,0) and ARIMA(1,1,1) showed that ARIMA(1,1,0) is most parsimonious for short-term forecasting.
The ARIMA(1,1,0) model was used to forecast cases for 15–21 June 2020.
- Forecast values: 1470, 1468, 1480, 1488, 1497, 1506, 1515
- Upper & lower confidence intervals provided for each date.
- MAE: 365.5
- MSE: 526,452.5
- RMSE: 725.57
- R-squared: 0.878
- Stationary R-squared is low due to non-stationarity of the raw series (corrected by differencing)
- The ARIMA(1,1,0) model tracks the downward trend in daily Covid-19 cases as of June 2020.
- All parameters are statistically significant; diagnostics confirm model adequacy.
- Residuals are uncorrelated and near zero mean, supporting forecast validity.
- Alternative models (ARIMA(0,1,0) and (1,1,1)) were not significantly better or overfit.
The chosen ARIMA(1,1,0) model provides a simple and effective means to forecast daily Covid-19 cases over short windows, supporting public health planning with honest evaluation and error bounds. Forecast accuracy (MAE ≈ 365, RMSE ≈ 726) is reasonable given the data volatility. For future work, incorporating exogenous predictors or using more recent data may further improve accuracy.
Covid19_uk.xlsx
(original data)- SPSS .sav and output files (available upon request)
- All visualizations and tables produced in IBM SPSS