QUICK REVIEW

[论文解读] Artificial Intelligence Forecasting of Covid-19 in China

Zixin Hu, Qiyang Ge|arXiv (Cornell University)|Feb 17, 2020

COVID-19 epidemiological studies参考文献 7被引用 118

一句话总结

本文提出一种受AI启发的方法，使用改进后的堆叠自编码器来预测 COVID-19 在 China 的轨迹，获得多步预测误差在低个位数，并将省份按照传播基础分组。

ABSTRACT

BACKGROUND An alternative to epidemiological models for transmission dynamics of Covid-19 in China, we propose the artificial intelligence (AI)-inspired methods for real-time forecasting of Covid-19 to estimate the size, lengths and ending time of Covid-19 across China. METHODS We developed a modified stacked auto-encoder for modeling the transmission dynamics of the epidemics. We applied this model to real-time forecasting the confirmed cases of Covid-19 across China. The data were collected from January 11 to February 27, 2020 by WHO. We used the latent variables in the auto-encoder and clustering algorithms to group the provinces/cities for investigating the transmission structure. RESULTS We forecasted curves of cumulative confirmed cases of Covid-19 across China from Jan 20, 2020 to April 20, 2020. Using the multiple-step forecasting, the estimated average errors of 6-step, 7-step, 8-step, 9-step and 10-step forecasting were 1.64%, 2.27%, 2.14%, 2.08%, 0.73%, respectively. We predicted that the time points of the provinces/cities entering the plateau of the forecasted transmission dynamic curves varied, ranging from Jan 21 to April 19, 2020. The 34 provinces/cities were grouped into 9 clusters. CONCLUSIONS The accuracy of the AI-based methods for forecasting the trajectory of Covid-19 was high. We predicted that the epidemics of Covid-19 will be over by the middle of April. If the data are reliable and there are no second transmissions, we can accurately forecast the transmission dynamics of the Covid-19 across the provinces/cities in China. The AI-inspired methods are a powerful tool for helping public health planning and policymaking.

研究动机与目标

为在 China 实时 COVID-19 预测提供传统流行病模型的 AI 驱动替代方案。
开发一个改进的堆叠自编码器，以基于 2020 年初的数据建模传播动态。
利用潜变量表示对省份/城市进行聚类并揭示传播结构。
提供累计确诊病例的多步预测，并识别各地区的 plateau（平台期）时机。

提出的方法

开发一个改进的堆叠自编码器来建模 COVID-19 传播动力学。
将该模型应用于 2020 年 1 月 11 日至 2 月 27 日的中国实时省级数据。
利用自编码器的潜变量进行聚类，将省份/城市分组为基于传播的簇。
执行多步预测，以预测 2020 年 1 月 20 日至 2020 年 4 月 20 日的累计病例。
使用 6-到 10 步预测区间的逐步误差来估计预测准确性。

实验结果

研究问题

RQ1AI 启发的模型，特别是改进的堆叠自编码器，是否能够在 China 的省份和城市中准确预测 COVID-19 的轨迹？
RQ2潜变量表示如何捕捉区域传播结构并支持对省份/城市的聚类？
RQ3在 China 的累计确诊病例上，6–10 步多步预测的预期准确度是多少？
RQ4在数据可靠的假设下，省份/城市在预测的传播动态中何时进入平台期？

主要发现

基于 AI 的方法在多步时间范围内（6 步至 10 步）的平均预测误差较低，分别报告为 1.64%、2.27%、2.14%、2.08%、和 0.73%。
基于潜在表示，将 34 个省份/城市分为 9 个簇。
该模型对 2020 年 1 月 20 日至 2020 年 4 月 20 日在 China 的累计确诊病例轨迹进行了预测。
在数据可靠且无二次传播的前提下，预测疫情可能在四月中旬结束。
潜在变量聚类揭示了省份/城市之间的传播结构，支持有针对性的公共卫生规划。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。