[论文解读] Reinforced Genetic Algorithm for Structure-based Drug Design
RGA 将强化学习与基于三维靶标-配体结构引导的遗传算法相结合,以改善结构基础药物设计中的对接优化;多靶点的预训练和知识转移提升性能与鲁棒性。
Structure-based drug design (SBDD) aims to discover drug candidates by finding molecules (ligands) that bind tightly to a disease-related protein (targets), which is the primary approach to computer-aided drug discovery. Recently, applying deep generative models for three-dimensional (3D) molecular design conditioned on protein pockets to solve SBDD has attracted much attention, but their formulation as probabilistic modeling often leads to unsatisfactory optimization performance. On the other hand, traditional combinatorial optimization methods such as genetic algorithms (GA) have demonstrated state-of-the-art performance in various molecular optimization tasks. However, they do not utilize protein target structure to inform design steps but rely on a random-walk-like exploration, which leads to unstable performance and no knowledge transfer between different tasks despite the similar binding physics. To achieve a more stable and efficient SBDD, we propose Reinforced Genetic Algorithm (RGA) that uses neural models to prioritize the profitable design steps and suppress random-walk behavior. The neural models take the 3D structure of the targets and ligands as inputs and are pre-trained using native complex structures to utilize the knowledge of the shared binding physics from different targets and then fine-tuned during optimization. We conduct thorough empirical studies on optimizing binding affinity to various disease targets and show that RGA outperforms the baselines in terms of docking scores and is more robust to random initializations. The ablation study also indicates that the training on different targets helps improve performance by leveraging the shared underlying physics of the binding processes. The code is available at https://github.com/futianfan/reinforced-genetic-algorithm.
研究动机与目标
- 通过整合蛋白质结构信息来解决传统GA在结构基础药物设计中的低效和不稳定性。
- 将进化过程重新表述为进化马尔可夫决策过程(EMDP),以使强化学习成为可能。
- 开发靶标-配体等变神经网络,利用3D结构数据引导交叉和变异。
- 在原生蛋白-配体复合物上进行预训练,并实现跨靶点的知识转移,以捕捉共同的结合物理规律。
- 展示在包括SARS-CoV-2主蛋白酶在内的多种疾病靶点上提高的对接分数和鲁棒性。
提出的方法
- 将GA建模为具有种群级状态和对接分数奖励的进化马尔可夫决策过程(EMDP)。
- 使用两个策略网络引导交叉(两步父代选择)和两个策略网络引导变异(父代选择和反应选择)。
- 使用E(3)等变神经网络处理靶标-配体复合物并输出动作概率。
- 在3D结合亲和任务上使用CrossDocked2020数据对ENNs进行预训练,以捕捉共享的结合物理规律,然后在优化过程中进行微调。
- 用策略梯度(REINFORCE)优化策略,以最大化期望的对接分数改进。
- 以AutoDock Vina作为对接预言机,并通过化学意义的单体和双体分子反应设计变异,以确保可合成性。
实验结果
研究问题
- RQ1强化学习引导的GA是否能在对接分数优化方面超越基线的结构基础设计方法?
- RQ2利用靶标结构信息是否能降低随机性并提高多次试验的鲁棒性?
- RQ3在原生复合物上的预训练和跨靶点的知识转移是否能提升SBDD的性能?
- RQ4与仅局部变异的RL方法相比,加入长距离交叉如何影响优化?
主要发现
- RGA在所评估靶点上实现了最佳的TOP-100、TOP-10和TOP-1对接分数。
- RGA在五次独立运行中的方差更小,表明随机游走行为被抑制。
- 在多样靶点上进行知识转移和预训练,进一步提升top-k对接分数的表现。
- 与Autogrow 4.0相比,RGA由于学习到的动作引导和更长距离的导航,提供了更优的对接性能。
- 长距离交叉导航优于仅关注局部修改的RL方法,体现了结构信息驱动搜索的优势。
- 该方法保持具有竞争力的QED和SA分数,表明结构质量和可合成性合理。
更好的研究,从现在开始
从论文设计到论文写作,大幅缩短您的研究时间。
无需绑定信用卡
本解读由 AI 生成,并经人工编辑审核。