QUICK REVIEW

[논문 리뷰] Learning Multimodal Graph-to-Graph Translation for Molecular Optimization

Wengong Jin, Kevin Yang|arXiv (Cornell University)|2018. 12. 03.

Machine Learning in Materials Science참고 문헌 51인용 수 154

한 줄 요약

논문은 junction-tree 인코더-디코더와 잠재 코드 및 적대적 scaffold 정규화를 사용하여 분자 최적화를 위한 다중모드 그래프-투-그래프 변환 모델을 제안하고, 여러 작업에서 state-of-the-art 결과와 다양한 출력을 달성한다.

ABSTRACT

We view molecular optimization as a graph-to-graph translation problem. The goal is to learn to map from one molecular graph to another with better properties based on an available corpus of paired molecules. Since molecules can be optimized in different ways, there are multiple viable translations for each input graph. A key challenge is therefore to model diverse translation outputs. Our primary contributions include a junction tree encoder-decoder for learning diverse graph translations along with a novel adversarial training method for aligning distributions of molecules. Diverse output distributions in our model are explicitly realized by low-dimensional latent vectors that modulate the translation process. We evaluate our model on multiple molecular optimization tasks and show that our model outperforms previous state-of-the-art baselines.

연구 동기 및 목표

Motivate molecular optimization as a graph-to-graph translation problem with parallel data.
Enable diverse (multimodal) outputs by injecting low-dimensional latent codes into decoding.
Ensure chemical validity and target-domain alignment via adversarial scaffold regularization.
Leverage a junction-tree based encoder-decoder to hierarchically generate valid molecular graphs.
Demonstrate improvements over baselines across multiple property optimization tasks.

제안 방법

Encode molecules with a shared junction-tree and graph encoder using graph message passing.
Decode first a junction tree with a tree-GRU and attention to predict scaffolds, then assemble the graph from clusters.
Introduce low-dimensional latent codes z to capture multimodal translations and train with a variational objective.
Apply adversarial regularization on continuous scaffold representations to align translated outputs with the target domain.
Train a graph decoder with a contextual loss (Eq. 10) and utilize an adversarial (WGAN-GP) objective over decoder embeddings to enforce scaffold realism.
Optionally, use a VAE-like objective with a KL term to encourage latent codes to follow a prior.

실험 결과

연구 질문

RQ1Can a multimodal graph-to-graph translation framework learn diverse, property-improving molecular mappings from paired data?
RQ2Does a junction-tree based encoder-decoder improve validity and quality of generated molecules compared to sequence-based or flat graph approaches?
RQ3Do latent codes enable meaningful diversity without sacrificing translation accuracy or chemical validity?
RQ4Does adversarial scaffold regularization help align translated molecules with the target domain distributions?
RQ5How does the proposed method compare to MMPA, JT-VAE, GCPN, and VSeq2Seq baselines on penalized logP, QED, and DRD2 optimization tasks?

주요 결과

The model outperforms baselines on penalized logP, achieving higher improvements and diverse outputs across two similarity thresholds.
On QED and DRD2 tasks, the method (VJTNN and VJTNN+GAN) shows higher success rates and competitive diversity and novelty metrics compared to baselines.
VJTNN+GAN provides marginal diversity gains over VJTNN, with strong performance on QED and DRD2 under explicit target domains.
The approach yields diverse and novel molecules while leveraging parallel data for sample efficiency beyond MMPA rules-based methods.
A junction-tree based encoder-decoder with latent multimodal decoding attains superior translation accuracy and property improvements.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.