QUICK REVIEW

[论文解读] Global Optimization of Atomic Clusters via Physically-Constrained Tensor Train Decomposition

Konstantin Sozykin, Nikita Rybin|arXiv (Cornell University)|Jan 26, 2026

Machine Learning in Materials Science被引用 0

一句话总结

该论文提出一个基于 TT 的框架，结合代数（TTOpt）与概率（PROTES）策略，通过利用势能面的低秩张量表示，对原子簇进行全局优化，并采用物理约束编码。

ABSTRACT

The global optimization of atomic clusters represents a fundamental challenge in computational chemistry and materials science due to the exponential growth of local minima with system size (i.e., the curse of dimensionality). We introduce a novel framework that overcomes this limitation by exploiting the low-rank structure of potential energy surfaces through Tensor Train (TT) decomposition. Our approach combines two complementary TT-based strategies: the algebraic TTOpt method, which utilizes maximum volume sampling, and the probabilistic PROTES method, which employs generative sampling. A key innovation is the development of physically-constrained encoding schemes that incorporate molecular constraints directly into the discretization process. We demonstrate the efficacy of our method by identifying global minima of Lennard-Jones clusters containing up to 45 atoms. Furthermore, we establish its practical applicability to real-world systems by optimizing 20-atom carbon clusters using a machine-learned Moment Tensor Potential, achieving geometries consistent with quantum-accurate simulations. This work establishes TT-decomposition as a powerful tool for molecular structure prediction and provides a general framework adaptable to a wide range of high-dimensional optimization problems in computational material science.

研究动机与目标

动机并解决能量景观具有指数级大量局部极小值的原子簇的全局优化问题。
提出一个基于 TT 的框架，将能量面表示为低秩张量，以缓解维数灾难。
开发物理约束的编码方案，将分子约束纳入离散化。
结合代数和概率的 TT 基优化方法，以识别近似全局极小值。
在 Lennard-Jones 簇，原子数最多为 45 个以及使用机器学习势的碳簇上展示有效性。

提出的方法

将离散网格上的聚集能量 E(x) 表示为 d 维张量 E，并采用 TT-format 以实现低秩存储与运算。
使用两种 TT 基优化策略：TTOpt（基于 maxvol 的代数优化）和 PROTES（基于 TT 的密度估计的概率抽样）。
TTOpt 处理张量展开，并迭代选择最大体积的子矩阵以定位近全局极小值，可能具有自适应秩。
PROTES 构建一个 TT 分解的概率分布 p_theta(n)，其与平方的 TT 张量成正比，以对候选构型进行采样，然后使用前 k 个能量候选通过 SGD 更新 TT 核。
通过多种离散化方案（Direct、Relative、Bit 编码）对粒子坐标进行编码，以纳入平移/旋转不变性和物理间距约束。
结合 PROTES 和各种编码方案的物理约束初始化，用以确保现实且不重叠的构型。

实验结果

研究问题

RQ1张量-Train 表示是否能捕获原子簇高维势能面的低秩结构？
RQ2TTOpt 与 PROTES 是否在崎岖能量景观的全局优化中提供互补的优势？
RQ3物理约束编码方案如何影响 TT 基优化中的搜索效率和解的质量？
RQ4该框架在 Lennard-Jones 簇（最多 45 个原子）以及用机器学习势建模的碳簇上是否有效？
RQ5TT 基极小值相对于量子精确模拟或高保真势的准确性如何？

主要发现

使用所提出的 TT 基框架识别出 Lennard-Jones 簇中最多 45 原子的全局极小值。
当使用 machine-learned Moment Tensor Potential 时，该方法获得的 20 原子碳簇几何构型与量子精确模拟一致。
代数型 TTOpt 与概率型 PROTES 的结合利用低秩结构和信息几何来应对高维优化。
物理约束编码有效地缩小搜索空间并强制合理的原子间距离和角度构型。
TT 分解使得计算材料科学中的高维优化问题的存储与计算更加高效。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。