QUICK REVIEW

[论文解读] Unlocking the Value of Text: Event-Driven Reasoning and Multi-Level Alignment for Time Series Forecasting

Siyuan Wang, Peng Chen|arXiv (Cornell University)|Mar 16, 2026

Time Series Analysis and Forecasting被引用 0

一句话总结

VoT 通过使用外生和内生文本、事件驱动推理和多层对齐来改进时间序列预测，在10个真实世界数据集上达到最先进的结果。

ABSTRACT

Existing time series forecasting methods primarily rely on the numerical data itself. However, real-world time series exhibit complex patterns associated with multimodal information, making them difficult to predict with numerical data alone. While several multimodal time series forecasting methods have emerged, they either utilize text with limited supplementary information or focus merely on representation extraction, extracting minimal textual information for forecasting. To unlock the Value of Text, we propose VoT, a method with Event-driven Reasoning and Multi-level Alignment. Event-driven Reasoning combines the rich information in exogenous text with the powerful reasoning capabilities of LLMs for time series forecasting. To guide the LLMs in effective reasoning, we propose the Historical In-context Learning that retrieves and applies historical examples as in-context guidance. To maximize the utilization of text, we propose Multi-level Alignment. At the representation level, we utilize the Endogenous Text Alignment to integrate the endogenous text information with the time series. At the prediction level, we design the Adaptive Frequency Fusion to fuse the frequency components of event-driven prediction and numerical prediction to achieve complementary advantages. Experiments on real-world datasets across 10 domains demonstrate significant improvements over existing methods, validating the effectiveness of our approach in the utilization of text. The code is made available at https://github.com/decisionintelligence/VoT.

研究动机与目标

有必要将文本信息引入以应对时间序列中的事件驱动动态。
提出对偶分支的VoT架构，将外生文本与内生文本与时间序列数据融合。
开发基于历史上下文学习的事件驱动推理以引导大模型进行预测。
引入表示层和预测层对齐以最大化跨模态整合。
在多领域展示最先进性能并给出消融分析。

提出的方法

将外生文本与内生文本的双分支架构集成用于预测。
事件驱动推理流水线，包含模板生成、摘要、以及通过历史上下文学习（HIC）增强的Reasoner。
内生文本对齐（ETA），通过解耦模式提取与解耦对比学习将时间序列模式与文本表示对齐。
自适应频率融合（AFF），通过学习的频率感知权重融合事件驱动与数值预测的频率分量。
综合训练目标，结合时间序列监督、跨模态对齐与最终融合损失。
全数据集消融与外生文本对比内生文本影响分析。

实验结果

研究问题

RQ1如何有效利用外生文本与内生文本来提升时间序列预测？
RQ2哪些机制（HIC、ETA、AFF）最能实现跨模态对齐与推理以获得更高预测准确性？
RQ3来自外生文本的事件驱动解释在多领域是否具有可测量的提升？
RQ4与仅使用文本或仅使用时间序列相比，多层对齐如何影响性能？

主要发现

Category	Model	MSE	MAE
Agriculture	VoT	0.209	0.302
Climate	VoT	1.078	0.840
Economy	VoT	0.201	0.353
Energy	VoT	0.222	0.343
Environment	VoT	0.268	0.380
Health	VoT	1.205	0.714
Security	VoT	70.117	3.937
Social Good	VoT	0.804	0.389
Traffic	VoT	0.169	0.232
Weather	VoT	0.968	0.706

VoT 在几乎所有数据集上均取得最佳性能，在所有20个指标上对比仅时间序列的基线和文本增强基线位列第一，对比多模态基线在20个指标中有19项领先。
带有HIC的外生文本驱动事件推理提供显著增益，消融实验显示ETA或HIC移除时性能下降。
内生文本对齐（ETA）通过分解趋势与季节分量并将其与文本表示对齐，提升了表示层面的跨模态对齐。
自适应频率融合（AFF）动态融合事件驱动与数值预测的频率分量，优于固定融合基线。
外生文本的贡献优于随机或基于内生文本的替代方法，体现了真实世界外生信息在事件驱动预测中的价值。
在10个真实世界领域（MM-TSFLib+天气数据集）上的广泛实验显示出普遍性与鲁棒性。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。