QUICK REVIEW

[论文解读] Accuracy and Efficiency Benchmarks of Pretrained Machine Learning Potentials for Molecular Simulations

Peter Eastman, Evan Pretti|arXiv (Cornell University)|Jan 22, 2026

Machine Learning in Materials Science被引用 0

一句话总结

本文对15种预训练的MLIPs在分子模拟中的性能进行基准测试，评估准确性、速度、内存和稳定性，以帮助模型选择。

ABSTRACT

The rapid development of pretrained Machine Learning Interatomic Potentials (MLIPs) that cover a wide range of molecular species has made it challenging to select the best model for a given application. We benchmark 15 pretrained MLIPs, evaluating each one on accuracy, speed, memory use, and ability to produce stable simulations. This provides an objective basis for practitioners to select the most appropriate MLIP for their own simulations, and offers insight into which factors most strongly influence model accuracy. We find that the number of model parameters and the size of the training set are both strongly correlated with accuracy, while training on charged molecules and including explicit Coulomb energy terms are less essential than one might expect. Speed and memory use are determined as much by the model architecture as by the size of the model.

研究动机与目标

有必要在多种分子模拟任务中选择合适的预训练MLIPs，以提高结果可靠性与适用性。
提供一个客观、标准化的多模型基准测试。
识别最影响MLIP准确性与效率的因素。
就在准确性与资源使用之间的权衡，给出实用的模型选择指南。

提出的方法

对15种预训练MLIPs在常见分子模拟任务中进行基准测试。
评估每个模型的准确性、速度、内存使用量和模拟稳定性。
分析模型大小、训练集规模与准确性之间的相关性。
评估显式库仑能项对性能的影响。
描述架构选择如何在不依赖参数数量的情况下影响速度与内存。

实验结果

研究问题

RQ1哪些预训练MLIPs在与其速度和内存成本相比时提供最佳准确性？
RQ2模型大小与训练集规模在跨模型的准确性上有何相关性？
RQ3显式库仑能项是否对这些MLIPs带来可观的收益？
RQ4模型架构选择在多大程度上决定速度和内存使用？

主要发现

模型准确性与参数数量以及训练集规模之间存在强相关性。
显式库仑能项在所评估的模型中并未提供可测量的收益。
速度和内存使用更多地受模型架构影响，而不仅仅是模型大小。
该研究为基于应用特定的准确性与效率需求来选择MLIPs提供了客观依据。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。