QUICK REVIEW

[論文レビュー] Edge Attention-based Multi-Relational Graph Convolutional Networks

Chao Shang, Qinqing Liu|arXiv (Cornell University)|Feb 14, 2018

Computational Drug Discovery Methods参考文献 34被引用数 68

ひとこと要約

この論文は、結合属性が複数あるグラフを扱う GCN、EAGCN を提案し、結合属性ごとにエッジ注意マトリクスを学習し、分子特性予測のためにノード特徴を集約する。

ABSTRACT

Graph convolutional network (GCN) is generalization of convolutional neural network (CNN) to work with arbitrarily structured graphs. A binary adjacency matrix is commonly used in training a GCN. Recently, the attention mechanism allows the network to learn a dynamic and adaptive aggregation of the neighborhood. We propose a new GCN model on the graphs where edges are characterized in multiple views or precisely in terms of multiple relationships. For instance, in chemical graph theory, compound structures are often represented by the hydrogen-depleted molecular graph where nodes correspond to atoms and edges correspond to chemical bonds. Multiple attributes can be important to characterize chemical bonds, such as atom pair (the types of atoms that a bond connects), aromaticity, and whether a bond is in a ring. The different attributes lead to different graph representations for the same molecule. There is growing interests in both chemistry and machine learning fields to directly learn molecular properties of compounds from the molecular graph, instead of from fingerprints predefined by chemists. The proposed GCN model, which we call edge attention-based multi-relational GCN (EAGCN), jointly learns attention weights and node features in graph convolution. For each bond attribute, a real-valued attention matrix is used to replace the binary adjacency matrix. By designing a dictionary for the edge attention, and forming the attention matrix of each molecule by looking up the dictionary, the EAGCN exploits correspondence between bonds in different molecules. The prediction of compound properties is based on the aggregated node features, which is independent of the varying molecule (graph) size. We demonstrate the efficacy of the EAGCN on multiple chemical datasets: Tox21, HIV, Freesolv, and Lipophilicity, and interpret the resultant attention weights.

研究の動機と目的

複数の結合属性を持つ分子グラフから直接分子特性の学習を動機づける。
バイナリ隣接行列の代わりにエッジ固有の注意マトリクスを用いるGCNの派生形を提案する。
結合および分子間でエッジ注意を共有するための辞書を用いて、分子間の一貫性を可能にする。
標準的な化学データセットでモデルの有効性を示す。
学習された注意重みに対して解釈を行い、結合の重要性への洞察を提供する。

提案手法

各結合属性に対してエッジ注意マトリクスを導入し、バイナリ隣接を置換する。
異なる分子間で対応を可能にするため、エッジ注意をマッピングする辞書を用いる。
グラフ畳み込み内で注意重みとノード特徴を共同学習する。
グラフサイズに依存しない分子レベルの予測を生成するためにノード特徴を集約する。
化学データセットで評価し、性能を検証し、注意を解釈する。

実験結果

リサーチクエスチョン

RQ1複数の結合属性に対するエッジごとの注意は、二値隣接マトリクスのGCNより分子特性予測を改善できるか？
RQ2辞書を介して分子間でエッジ注意を共有することは、学習と一般化にどう影響するか？
RQ3学習された注意重は、性質の結合関連決定要因について解釈可能な洞察を提供するか？
RQ4EAGCN は、手作りのフィンガープリントを用いず、サイズや属性が異なるグラフに対して拡張可能か？

主な発見

EAGCN は化学データセット Tox21、HIV、Freesolv、Lipophilicity で有効性を示す。
このモデルは多関係グラフに対して注意重みとノード特徴を共同で学習する。
注意重は、予測における結合の重要性を理解するために解釈できる。
エッジ注意アプローチは、複数の結合属性を横断する動的で適応的な集約を可能にする。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。