QUICK REVIEW

[論文レビュー] Improved motif-scaffolding with SE(3) flow matching

Jason Yim, Andrew M. Campbell|PubMed|Jan 8, 2024

Protein Structure and Dynamics参考文献 15被引用数 8

ひとこと要約

本論文は FrameFlow をモチーフ条件付けのアモルタイズメントとモチーフガイダンスで拡張し、モチーフスキャフォルディングを実行する。RFdiffusionと比較して設計性を保ちつつ骨格の多様性を高めている。

ABSTRACT

Protein design often begins with the knowledge of a desired function from a motif which motif-scaffolding aims to construct a functional protein around. Recently, generative models have achieved breakthrough success in designing scaffolds for a range of motifs. However, generated scaffolds tend to lack structural diversity, which can hinder success in wet-lab validation. In this work, we extend FrameFlow, an SE(3) flow matching model for protein backbone generation, to perform motif-scaffolding with two complementary approaches. The first is motif amortization, in which FrameFlow is trained with the motif as input using a data augmentation strategy. The second is motif guidance, which performs scaffolding using an estimate of the conditional score from FrameFlow without additional training. On a benchmark of 24 biologically meaningful motifs, we show our method achieves 2.5 times more designable and unique motif-scaffolds compared to state-of-the-art. Code: https://github.com/microsoft/protein-frame-flow.

研究の動機と目的

Motifを拡張して SE(3) flow matching を二つの戦略：motif amortization と motif guidance に拡張して、モチーフスキャフォルディングを改善する。
条件付き（アモルタイズド）と無条件（ガイダンス）アプローチを PDB ベースのモチーフスキャフォルディングベンチマークで比較する。
FrameFlow バリアントが RFdiffusion より等価または優れた設計性とより高いモチーフスキャフォルディングの多様性を達成することを示す。
軽量な FrameFlow モデルが現在の最先端手法よりもパラメータ数とトレーニングリソースを少なくて済むことを示す。

提案手法

FrameFlow を moti f conditioning して与えられたモチーフの周りにスキャフォルドを生成する（motif amortization）。
追加学習なしで無条件 FrameFlow モデルを用いてモチーフをサンプリング軌道に条件付けすることで motif guidance を開発する。
翻訳：SE(3) バックボーン表現を Riemannian flow matching により、平行移動と SO(3) 回転の分解ベクトル場で表現する。
無 label の PDB からのモチーフ分布を模擬する motif data augmentation で motif-amortized FrameFlow を訓練する。
生成には 500 timesteps Euler-Maruyama sampling を使用し、設計性と多様性の指標で評価する。
モチーフスキャフォルディングベンチマークで RFdiffusion と TDS と比較する。

Figure 1: We present two strategies for motif-scaffolding. Top : motif amortization trains a flow model to condition on the motif (blue) and generate the scaffold (red). During training, only the scaffold is corrupted with noise. Bottom : motif guidance re-purposes a flow model that is trained to ge

実験結果

リサーチクエスチョン

RQ1motif amortization または motif guidance は SE(3) flow matching での以前の最先端手法（RFdiffusion）よりモチーフスキャフォルディング性能を改善するか。
RQ2条件付き（amortized）と無条件（guidance）アプローチは設計性と骨格多様性で差があるか。
RQ3FrameFlow は軽量で訓練可能なまま、モチーフごとにより多くの独自で設計性のあるスキャフォールドを提供するか。
RQ4無条件のバックボーン結果は FrameFlow-guidance によるモチーフスキャフォルディングの信頼性をどう裏付けるか。

主な発見

方法	設計性 (↑)	多様性 (↑)	新規性 (↓)
FrameFlow	0.80	171	0.61
RFdiffusion	0.87	156	0.64

モチーフアモルタイズを用いる FrameFlow はベンチマークで 21 モチーフを解決、RFdiffusion は 20。
FrameFlow-guidance は 20 モチーフを解決し、RFdiffusion と同等の性能。
モチーフごとに見ると FrameFlow 手法は RFdiffusion より独自の設計可能な骨格を 2.5 倍多く生み出す。
無条件 FrameFlow は RFdiffusion と同等の設計性を保ちながら、より高い多様性と新規性を達成。
FrameFlow 条件付きアプローチはネットワークを 3x 小型化（16.8M vs 59.8M パラメータ）で事前学習なし。
FrameFlow を用いたモチーフスキャフォルディングは RFdiffusion および TDS より総合的に骨格の多様性を拡大する。

Figure 2: Motif data augmentation. Each protein in the dataset does not come with pre-defined motif-scaffold annotations. Instead, we construct plausible motifs at random to simulate sampling from the distribution of motifs and scaffolds.

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。