QUICK REVIEW

[论文解读] Scalable Spatio-Temporal SE(3) Diffusion for Long-Horizon Protein Dynamics

Nima Shoghi, Yuxuan Liu|arXiv (Cornell University)|Feb 2, 2026

Protein Structure and Dynamics被引用 0

一句话总结

STAR-MD 是一个包含联合时空注意力的 SE(3)-等变自回归扩散模型，能够在微秒时间尺度上生成物理上合理的蛋白质轨迹，并在长期动力学方面优于先前模型。它在 ATLAS 上实现稳定高质量的回放并能对长时间尺度进行外推。

ABSTRACT

Molecular dynamics (MD) simulations remain the gold standard for studying protein dynamics, but their computational cost limits access to biologically relevant timescales. Recent generative models have shown promise in accelerating simulations, yet they struggle with long-horizon generation due to architectural constraints, error accumulation, and inadequate modeling of spatio-temporal dynamics. We present STAR-MD (Spatio-Temporal Autoregressive Rollout for Molecular Dynamics), a scalable SE(3)-equivariant diffusion model that generates physically plausible protein trajectories over microsecond timescales. Our key innovation is a causal diffusion transformer with joint spatio-temporal attention that efficiently captures complex space-time dependencies while avoiding the memory bottlenecks of existing methods. On the standard ATLAS benchmark, STAR-MD achieves state-of-the-art performance across all metrics--substantially improving conformational coverage, structural validity, and dynamic fidelity compared to previous methods. STAR-MD successfully extrapolates to generate stable microsecond-scale trajectories where baseline methods fail catastrophically, maintaining high structural quality throughout the extended rollout. Our comprehensive evaluation reveals severe limitations in current models for long-horizon generation, while demonstrating that STAR-MD's joint spatio-temporal modeling enables robust dynamics simulation at biologically relevant timescales, paving the way for accelerated exploration of protein function.

研究动机与目标

说明需要生成超出纳秒级别的长时域蛋白质轨迹的动机，并解决现有模型的局限性。
开发一个可扩展的自回归 SE(3) 扩散模型，具有联合时空注意力，以捕捉非可分离的时空动态。
通过区块因果注意力和情境噪声扰动等训练技术，实现长时域回放的稳定性。
展示在 ATLAS 100 ns 轨迹上的最先进性能，并对 240 ns 和 1 μs 时域进行鲁棒外推。
通过 Mori-Zwanzig 形式化理论提供对非马尔可夫性粗粒化动力学及记忆效应的理论支撑。

提出的方法

对 SE(3) 的自回归扩散框架，表示每个残基的平移和旋转。
在残基框架标记上进行联合时空注意力，捕捉不可分离的依赖关系。
采用块状扩散风格的因果训练并使用 KV 缓存以实现长时间域生成的高效性。
训练与推理阶段的情境噪声扰动以减小回放中的误差积累。
通过自适应层归一化实现对多时间尺度的连续时间条件化。
以 Mori-Zwanzig 形式化为基础，证明历史依赖性和不可分离记忆核的合理性。

实验结果

研究问题

RQ1STAR-MD 是否能够在微秒时间尺度上为大型蛋白生成物理上合理的轨迹？
RQ2联合时空注意力是否在捕捉耦合的时空动态方面优于先验的先时后空架构？
RQ3在仅用 100 ns 数据训练的情况下，STAR-MD 对更长的时域（如 240 ns 和 1 μs）外推的表现如何？
RQ4哪些架构与训练选择能显著提升长时域的稳定性和结构有效性？
RQ5与最先进的轨迹模型相比，STAR-MD 在构象覆盖、有效性和动态保真度方面的表现如何？

主要发现

STAR-MD 在 ATLAS 100 ns 基准测试中在构象覆盖、结构有效性和动态保真度方面达到最先进的性能。
STAR-MD 在长时域回放（长达 1 μs）中保持高结构有效性和广泛的构象覆盖，优于在长期回放中易退化的基线模型。
联合时空注意力与带情境噪声的自回归扩散能够实现稳健的长时域生成并减少误差累积。
STAR-MD 能以有竞争力的质量指标外推至 240 ns 和 1 μs 轨迹，而替代模型退化更快。
理论分析将非马尔可夫粗粒化动力学与记忆需求（历史相关核）联系起来，正当化模型设计。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。