QUICK REVIEW

[论文解读] Augmenting Genetic Algorithms with Deep Neural Networks for Exploring the Chemical Space

AkshatKumar Nigam, Pascal Friederich|arXiv (Cornell University)|Sep 25, 2019

Computational Drug Discovery Methods参考文献 30被引用 94

一句话总结

该论文提出了一种结合神经网络判别器的遗传算法，以促进多样性并引导在化学空间的探索，从而实现最先进的带惩罚项的 logP 分数。

ABSTRACT

Challenges in natural sciences can often be phrased as optimization problems. Machine learning techniques have recently been applied to solve such problems. One example in chemistry is the design of tailor-made organic materials and molecules, which requires efficient methods to explore the chemical space. We present a genetic algorithm (GA) that is enhanced with a neural network (DNN) based discriminator model to improve the diversity of generated molecules and at the same time steer the GA. We show that our algorithm outperforms other generative models in optimization tasks. We furthermore present a way to increase interpretability of genetic algorithms, which helped us to derive design principles.

研究动机与目标

为设计定制有机材料和分子而高效探索化学空间提供动力。
开发一种增强了自适应神经网络判别器的 GA，以提升多样性并引导搜索。
证明 GA-D 在分子优化任务上优于现有生成模型。
展示可解释性优势，揭示高性能分子设计原则。

提出的方法

使用一个以 SELFIES 格式表示分子群体的 GA。
设定适应度 F(m) = J(m) + beta * D(m)，其中 J(m) 是带惩罚的 logP，D(m) 是来自神经网络的判别分数。
在每代结束后用 GA 生成的数据和参考数据训练判别器，以偏向未来的适应度。
依赖 SELFIES 的鲁棒性以允许随机突变而不产生无效结构；包括一个苯基添加的突变规则（约 4%）。
采用随时间变化的自适应惩罚以减轻长期运行中的停滞并促进对不同分子家族的探索。

实验结果

研究问题

RQ1神经网络判别器与 GA 的集成是否能改善化学空间的多样性和探索性？
RQ2与传统 GA 相比，判别器如何影响长期优化和避免停滞？
RQ3GA-D 在带惩罚的 logP 与其他生成模型相比的性能提升有多大？
RQ4该方法是否能产生可解释的高性能分子设计规则？

主要发现

Model	Max. Penalized logP
GVAE + BO (Kusner et al., 2017)	2.87±0.06
VAE (Exploitation) (SD-VAE, 2018)	3.50±0.44
CVAE + BO (Gómez-Bombarelli et al., 2018)	4.85±0.17
ORGAN (Guimaraes et al., 2017)	3.52±0.08
JT-VAE (Jin et al., 2018a)	4.90±0.33
ChemTS (Yang et al., 2017)	5.6±0.5
GCPN (You et al., 2018)	7.87±0.07
Random SELFIES	6.19±0.63
GB-GA (Jensen, 2019) GA	7.4±0.9
GB-GA (Jensen, 2019) GA	15.76±5.71
GA (here)	12.61±0.81
GA + D (here)	13.31±0.63
GA + D(t) (here)	20.72±3.14

GA-D 的最大带惩罚的 logP 分数高于文献中的模型（例如，使用 DNN 时最高可达 20.72 ± 3.14，未使用 D 时为 13.31 ± 0.63）。
基于判别器的自适应惩罚通过惩罚过度代表的高性能家族并鼓励探索来降低停滞。
时间相关的自适应惩罚进一步改善了最大 J(m)，超越了早期方法，显示出强大的探索能力。
K-means 聚类和 PCA 分析揭示了与高 J(m) 相关的设计类别和规则。
该方法支持约束优化以及对数 P 与 QED 的同时优化，显示在不同任务中的多样性。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。