QUICK REVIEW

[論文レビュー] Artificial Intelligence Assists Discovery of Reaction Coordinates and Mechanisms from Molecular Dynamics Simulations

Hendrik Jung, Roberto Covino|arXiv (Cornell University)|Jan 14, 2019

Gaussian Processes and Bayesian Inference参考文献 2被引用数 39

ひとこと要約

この論文は、適応サンプリング、ニューラルネットワーク、シンボリック回帰を用いて反応座標を明らかにする MD シミュレーションのサンプリングを導くAI支援フレームワークを提示し、MD シミュレーションから分子機構を抽出する。解釈性と alanine dipeptide や LiCl のようなモデル系への適用性を実証する。

ABSTRACT

Exascale computing holds great opportunities for molecular dynamics (MD) simulations. However, to take full advantage of the new possibilities, we must learn how to focus computational power on the discovery of complex molecular mechanisms, and how to extract them from enormous amounts of data. Both aspects still rely heavily on human experts, which becomes a serious bottleneck when a large number of parallel simulations have to be orchestrated to take full advantage of the available computing power. Here, we use artificial intelligence (AI) both to guide the sampling and to extract the relevant mechanistic information. We combine advanced sampling schemes with statistical inference, artificial neural networks, and deep learning to discover molecular mechanisms from MD simulations. Our framework adaptively and autonomously initializes simulations and learns the sampled mechanism, and is thus suitable for massively parallel computing architectures. We propose practical solutions to make the neural networks interpretable, as illustrated in applications to molecular systems.

研究の動機と目的

実機を用いた機械的機序の発見における人間のボトルネックを削減して exascale MD を動機づける。
Massively parallel computing のために、シミュレーションを自動的に初期化し、サンプルされた機構を学習する AI フレームワークを開発する。
高度なサンプリング、統計推論、ニューラルネットワーク、シンボリック回帰を組み合わせて、解釈可能な反応座標を特定する。
MD コンテキストにおけるニューラルネットワークの解釈性を高める実践的アプローチを提供する。

提案手法

Metropolis-Hastings の受理を用いた遷移経路サンプリングを活用し、シュート配置を導く。
未知の反応座標 q(x) を Eq. (3) のようにディープニューラルネットワークで表現する。
ANN を訓練して反応座標を定義する入力座標を同定する。
differentiable Cartesian genetic programming（symbolic regression）を適用し、訓練済みの ANN を明示的な式で近似する。
シンボリック回帰中のモデル複雑さを制御し過剰適合を避けるための正則化項を組み込む。
分子系における反応座標の解釈可能な式を示す。

実験結果

リサーチクエスチョン

RQ1AI ガイド付きサンプリングは MD シミュレーションにおける遷移経路生成と収束をどのように加速できるか。
RQ2どの入力座標がさまざまな系を横断して反応座標を最も強く定義するか。
RQ3symbolic regression はニューラルネットワーク由来の反応座標を近似するコンパクトで解釈可能な式を生み出せるか。
RQ4AI フレームワークは alanine dipeptide や LiCl のモデル系で標準 TPS と比較してどう機能するか。
RQ5この MD コンテキストにおけるニューラルネットワークの解釈性を確保する実践的戦略は何か。

主な発見

AI 助ける MD は tested runs（alanine dipeptide, LiCl）において標準 TPS と比較して遷移経路の累積生成と受理を増加させる。
フレームワークは長いコントロールTPS のベースラインと比べTP 時間収束を加速する。
入力の関連性分析は反応座標を支配的に定義する小さな座標集合を同定する（例：alanine dipeptide の特定の二重結合回転成分など）。
シンボリック回帰は正則化の下で ANN 由来の反応座標を近似するコンパクトな式を生み出す。
モデル系は、解釈可能な q(x) 式が回収可能であることを示す（例：二面角項とロジスティック/ln 成分を含む q_SR 式）。
このアプローチは自動初期化と学習をサポートし、大規模並列計算アーキテクチャに適している。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。