QUICK REVIEW

[論文レビュー] A theoretical interpretation of variance-based convergence criteria in perturbation-based theories

Xiao-Hui Wang, Zhaoxi Sun|arXiv (Cornell University)|Feb 22, 2018

Advanced Thermodynamics and Statistical Mechanics参考文献 68被引用数 27

ひとこと要約

本論文は、摂動に基づく自由エネルギー計算における分散ベースの収束基準の理論的基盤を提供し、FEP や BAR などの推定器が自由エネルギー推定の分散に非線形に依存することを示している。分散ベースの収束指標は、標本サイズの制限と分布の仮定に起因して本質的にバイアスを有することを証明しており、不確実性が系統的に低く見積もられることにつながる。これは、実際の応用において標準誤差の推定が意味を持たないことを意味する。

ABSTRACT

In QM/MM indirect free energy simulation, QM/MM corrections can be obtained from integration of partial derivatives of alchemical Hamiltonians or from perturbation-based estimators including free energy perturbation (FEP) and acceptance ratio methods. With FEP or exponential averaging, researchers tend to only sample MM states and calculate single point energy to get the free energy estimates. In this case the sample size hysteresis arises and the convergence is determined by bias elimination rather than variance minimization. Various criteria are proposed to evaluate the convergence issue and numerical studies are reported. It has been found that criteria including variance of distribution, effective sample size, information entropies and so on can be used and they are variance-of-distribution-dependent. However, no theoretical interpretation is presented. In this paper we present theoretical interpretations to dig the underlying statistical nature behind the problem. The convergence criteria are proven to be related with variance of distribution in Gaussian approximated Exponential averaging. Further, we prove that these estimators are nonlinearly dependent on the variance of the free energy estimate. As these estimators are often orders of magnitude overestimated, the variance of the FEP estimate is orders of magnitude underestimated. Hence, computing this statistical uncertainty is meaningless. In numerical calculation from timeseries data the effective sample size is bounded by 1 and N and thus the variance of the free energy estimate is proven to be bounded by 0 and 1 (kBT)2 for EXP and 0 and 2 (kBT)2 for BAR, which indicates an inevitable underestimation. Specifically, the upper bounds for these estimators are sample-size dependent. The effective sample size is proven to be a function of the overlap scalar, from which the range of the overlap scalar can also be derived.

研究の動機と目的

摂動に基づく自由エネルギー手法で用いられる分散ベースの収束基準の統計的性質を理論的に解釈すること。
FEP や BAR 計算で用いられる分散推定器における系統的バイアスの根本的要因を特定すること。
有効標本サイズの上限と分布の仮定に起因して、自由エネルギー推定における標準誤差の推定が本質的に信頼できないことを示すこと。
一般的に用いられる収束指標（分散、有効標本サイズ、エントロピーなど）が実際にはなぜ誤解を招くのかを明確にすること。
指数関数的平均化および受容率平均化の下での自由エネルギー推定の分散に対する理論的上限を導出すること。

提案手法

ガウス近似下での指数平均化における自由エネルギー推定の分散と収束基準の理論的関係を導出する。
FEP および BAR 推定器が自由エネルギー推定の分散に非線形に依存することを証明し、精度の過大評価を引き起こすことを示す。
時系列データを分析して、有効標本サイズが 1 から N の間で制限されることを示し、EXP では自由エネルギー推定の分散が 0 から (kBT)² の間、BAR では 0 から 2(kBT)² の間で制限されることを示す。
有効標本サイズを決定づける重要なパラメータとしてのオーバーラップスカラーを導入し、その理論的範囲を導出する。
数学的導出と漸近的解析を用いて、自由エネルギー推定器における統計的不確実性の上界を確立する。
数値的図版（最終版で追加された新しい図 7 を含む）を用いて理論的予測の妥当性を検証する。

実験結果

リサーチクエスチョン

RQ1なぜ FEP や BAR における分散ベースの収束基準は、真の統計的不確実性を正しく反映しないことが多いのか？
RQ2実際の収束基準と自由エネルギー推定の分散との間には、理論的にどのような関係があるのか？
RQ3時系列データにおける有効標本サイズは、自由エネルギー推定の分散をどのように制限するのか？
RQ4なぜ摂動に基づく手法における自由エネルギー推定の標準誤差は、系統的に低く見積もられるのか？
RQ5オーバーラップスカラーは、収束基準の信頼性を決定づける上で果たす役割は何か？

主な発見

指数平均化における自由エネルギー推定の分散は、0 から (kBT)² の間で制限され、真の不確実性はこの上限を超えることはない。
BAR 法においては、自由エネルギー推定の分散が 0 から 2(kBT)² の間で制限されることを示し、統計的不確実性に根本的な上限があることを示している。
時系列データにおける有効標本サイズは 1 から N の間で制限され、これが分散ベースの収束基準の精度を本質的に制限する。
オーバーラップスカラーは有効標本サイズの範囲を決定づけ、その理論的範囲は解析から導出された。
分散ベースの収束基準は真の分散に非線形に依存しており、精度の過大評価と実際の不確実性の著しい低減評価を引き起こす。
その結果、これらの推定器から標準誤差を計算することは、系統的バイアスと制限された分散のため、統計的に意味を持たない。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。