QUICK REVIEW

[논문 리뷰] Deep Graph Contrastive Representation Learning

Yanqiao Zhu, Yichen Xu|arXiv (Cornell University)|2020. 06. 07.

Advanced Graph Neural Networks참고 문헌 44인용 수 412

한 줄 요약

GRACE learns unsupervised node embeddings by contrasting node representations across two corrupted graph views, improving over prior methods and even rivaling supervised approaches on transductive tasks.

ABSTRACT

Graph representation learning nowadays becomes fundamental in analyzing graph-structured data. Inspired by recent success of contrastive methods, in this paper, we propose a novel framework for unsupervised graph representation learning by leveraging a contrastive objective at the node level. Specifically, we generate two graph views by corruption and learn node representations by maximizing the agreement of node representations in these two views. To provide diverse node contexts for the contrastive objective, we propose a hybrid scheme for generating graph views on both structure and attribute levels. Besides, we provide theoretical justification behind our motivation from two perspectives, mutual information and the classical triplet loss. We perform empirical experiments on both transductive and inductive learning tasks using a variety of real-world datasets. Experimental experiments demonstrate that despite its simplicity, our proposed method consistently outperforms existing state-of-the-art methods by large margins. Moreover, our unsupervised method even surpasses its supervised counterparts on transductive tasks, demonstrating its great potential in real-world applications.

연구 동기 및 목표

Motivate unsupervised graph representation learning without relying on graph proximity reconstructions or injective readout constraints.
Propose a node-level contrastive framework that maximizes agreement across two corrupted graph views.
Develop dual graph view generation via topology and attribute corruption (edge removal and feature masking).
Provide theoretical connections to mutual information (InfoMax) and triplet loss.
Empirically validate on transductive and inductive node classification across multiple datasets.

제안 방법

Use a GNN encoder to produce node embeddings from two corrupted graph views.
Generate two views by removing edges (structure corruption) and masking features (attribute corruption).
Apply a contrastive loss that pulls corresponding node embeddings across views together and pushes apart all other node embeddings.
Use a two-layer MLP projection g to obtain a discriminative score between node pairs.
Optimize the average contrastive objective across all nodes, without relying on an explicit graph-level readout.

실험 결과

연구 질문

RQ1Can node-level contrastive learning on two graph views yield strong unsupervised node representations?
RQ2Do dual corruption strategies (structure and feature masking) provide diverse contexts to improve learning?
RQ3How does GRACE relate to mutual information maximization and the triplet loss framework?
RQ4How does GRACE perform on standard transductive and inductive graph datasets compared to existing unsupervised and supervised methods?

주요 결과

방법	학습 데이터	Cora	Citeseer	Pubmed	DBLP	Reddit	PPI
원시 특징	X	64.8	64.6	84.8	71.6	58.5	42.2
node2vec	A	74.8	52.3	80.3	78.8	—	—
DeepWalk	A	75.7	50.5	80.5	75.9	32.4	—
DeepWalk + 특징	X,A	73.1	47.6	83.7	78.1	69.1	—
GAE	X,A	76.9	60.6	82.9	81.2	—	—
VGAE	X,A	78.9	61.2	83.0	81.7	—	—
DGI	X,A	82.6 ±0.4	68.8 ±0.7	86.0 ±0.1	83.2 ±0.1	94.0 ±0.1	63.8 ±0.2
GRACE	X,A	83.3 ±0.4	72.1 ±0.5	86.7 ±0.1	84.2 ±0.1	94.2 ±0.0	66.2 ±0.1
SGC	X,A,Y	80.6	69.1	84.8	81.7	—	—
GCN	X,A,Y	82.8	72.0	84.9	82.7	—	—
—	—	—	—	—	—	—	—
유도 데이터셋
DeepWalk	A	—	—	—	—	32.4	—
DeepWalk + 특징	X,A	69.1	—	—	—	—	—
GraphSAGE-GCN	X,A	90.8	46.5	—	—	—	—
GraphSAGE-mean	X,A	89.7	48.6	—	—	—	—
GraphSAGE-LSTM	X,A	90.7	48.2	—	—	—	—
GraphSAGE-pool	X,A	89.2	50.2	—	—	—	—
DGI	X,A	94.0 ±0.1	63.8 ±0.2	—	—	94.0 ±0.1	63.8 ±0.2
GRACE	X,A	94.2 ±0.0	66.2 ±0.1	—	—	94.2 ±0.0	66.2 ±0.1
FastGCN	X,A,Y	93.7	—	—	—	—	—
GaAN-mean	X,A,Y	95.8 ±0.1	96.9 ±0.2	—	—	—	—

GRACE achieves state-of-the-art or competitive results among unsupervised methods across six datasets.
On transductive tasks, GRACE surpasses DGI and other baselines and can rival supervised models (e.g., GCN, SGC) on several datasets.
On inductive tasks, GRACE outperforms most baselines and approaches or matches supervised performance on Reddit and PPI datasets.
Theoretical analysis shows GRACE maximizes a lower bound on mutual information between input features and node embeddings in two views and relates to the triplet loss.
GRACE remains robust to sparse features and benefits from corruption at both topology and feature levels.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.