[论文解读] Artificial Intelligence Forecasting of Covid-19 in China
本文提出一种受AI启发的方法,使用改进后的堆叠自编码器来预测 COVID-19 在 China 的轨迹,获得多步预测误差在低个位数,并将省份按照传播基础分组。
BACKGROUND An alternative to epidemiological models for transmission dynamics of Covid-19 in China, we propose the artificial intelligence (AI)-inspired methods for real-time forecasting of Covid-19 to estimate the size, lengths and ending time of Covid-19 across China. METHODS We developed a modified stacked auto-encoder for modeling the transmission dynamics of the epidemics. We applied this model to real-time forecasting the confirmed cases of Covid-19 across China. The data were collected from January 11 to February 27, 2020 by WHO. We used the latent variables in the auto-encoder and clustering algorithms to group the provinces/cities for investigating the transmission structure. RESULTS We forecasted curves of cumulative confirmed cases of Covid-19 across China from Jan 20, 2020 to April 20, 2020. Using the multiple-step forecasting, the estimated average errors of 6-step, 7-step, 8-step, 9-step and 10-step forecasting were 1.64%, 2.27%, 2.14%, 2.08%, 0.73%, respectively. We predicted that the time points of the provinces/cities entering the plateau of the forecasted transmission dynamic curves varied, ranging from Jan 21 to April 19, 2020. The 34 provinces/cities were grouped into 9 clusters. CONCLUSIONS The accuracy of the AI-based methods for forecasting the trajectory of Covid-19 was high. We predicted that the epidemics of Covid-19 will be over by the middle of April. If the data are reliable and there are no second transmissions, we can accurately forecast the transmission dynamics of the Covid-19 across the provinces/cities in China. The AI-inspired methods are a powerful tool for helping public health planning and policymaking.
研究动机与目标
- 为在 China 实时 COVID-19 预测提供传统流行病模型的 AI 驱动替代方案。
- 开发一个改进的堆叠自编码器,以基于 2020 年初的数据建模传播动态。
- 利用潜变量表示对省份/城市进行聚类并揭示传播结构。
- 提供累计确诊病例的多步预测,并识别各地区的 plateau(平台期)时机。
提出的方法
- 开发一个改进的堆叠自编码器来建模 COVID-19 传播动力学。
- 将该模型应用于 2020 年 1 月 11 日至 2 月 27 日的中国实时省级数据。
- 利用自编码器的潜变量进行聚类,将省份/城市分组为基于传播的簇。
- 执行多步预测,以预测 2020 年 1 月 20 日至 2020 年 4 月 20 日的累计病例。
- 使用 6-到 10 步预测区间的逐步误差来估计预测准确性。
实验结果
研究问题
- RQ1AI 启发的模型,特别是改进的堆叠自编码器,是否能够在 China 的省份和城市中准确预测 COVID-19 的轨迹?
- RQ2潜变量表示如何捕捉区域传播结构并支持对省份/城市的聚类?
- RQ3在 China 的累计确诊病例上,6–10 步多步预测的预期准确度是多少?
- RQ4在数据可靠的假设下,省份/城市在预测的传播动态中何时进入平台期?
主要发现
- 基于 AI 的方法在多步时间范围内(6 步至 10 步)的平均预测误差较低,分别报告为 1.64%、2.27%、2.14%、2.08%、和 0.73%。
- 基于潜在表示,将 34 个省份/城市分为 9 个簇。
- 该模型对 2020 年 1 月 20 日至 2020 年 4 月 20 日在 China 的累计确诊病例轨迹进行了预测。
- 在数据可靠且无二次传播的前提下,预测疫情可能在四月中旬结束。
- 潜在变量聚类揭示了省份/城市之间的传播结构,支持有针对性的公共卫生规划。
更好的研究,从现在开始
从论文设计到论文写作,大幅缩短您的研究时间。
无需绑定信用卡
本解读由 AI 生成,并经人工编辑审核。