QUICK REVIEW

[論文レビュー] Data Augmentation for Deep Graph Learning: A Survey

Kaize Ding, Zhe Xu|arXiv (Cornell University)|Feb 16, 2022

Advanced Graph Neural Networks被引用数 27

ひとこと要約

本調査はグラフデータ拡張（GraphDA）を形式化し、拡張モダリティ別の分類を提供し、GraphDAの技術と低リソースおよび信頼性の高いグラフ学習への適用をレビューする。

ABSTRACT

Graph neural networks, a powerful deep learning tool to model graph-structured data, have demonstrated remarkable performance on numerous graph learning tasks. To address the data noise and data scarcity issues in deep graph learning, the research on graph data augmentation has intensified lately. However, conventional data augmentation methods can hardly handle graph-structured data which is defined in non-Euclidean space with multi-modality. In this survey, we formally formulate the problem of graph data augmentation and further review the representative techniques and their applications in different deep graph learning problems. Specifically, we first propose a taxonomy for graph data augmentation techniques and then provide a structured review by categorizing the related work based on the augmented information modalities. Moreover, we summarize the applications of graph data augmentation in two representative problems in data-centric deep graph learning: (1) reliable graph learning which focuses on enhancing the utility of input graph as well as the model capacity via graph data augmentation; and (2) low-resource graph learning which targets on enlarging the labeled training data scale through graph data augmentation. For each problem, we also provide a hierarchical problem taxonomy and review the existing literature related to graph data augmentation. Finally, we point out promising research directions and the challenges in future research.

研究の動機と目的

GraphDA問題とその課題を形式的に定式化する。
拡張モダリティ別のGraphDA技術の包括的な分類を提案する。
低リソースグラフ学習および信頼性の高いグラフ学習におけるGraphDAの適用をレビューする。
学習可能なGraphDA手法のトレーニングパラダイム（分離学習、結合学習、二レベル最適化）を要約する。
今後のGraphDA研究の未解決課題と有望な方向性を強調する。

提案手法

拡張隣接行列および特徴行列をもつ tilde_G を生成する拡張変換 f_theta(G) を定義する。
拡張手法を構造指向、特徴指向、ラベル指向に分類する。
学習可能なGraphDAの3つのトレーニングパラダイム：分離学習、結合学習、二レベル最適化を説明する。
拡張損失 L_aug および L_utility とそれらの最適化における役割を導入する。
GraphDA技術とそれらをさまざまなDGLタスクに適用することの構造化された文献レビューを提供する。

実験結果

リサーチクエスチョン

RQ1グラフデータ拡張（GraphDA）とは何か、どのように形式的に定義できるか。
RQ2GraphDA技術は拡張モダリティ別にどのように体系的に分類できるか。
RQ3GraphDA技術は低リソースグラフ学習および信頼性の高いグラフ学習にどのように適用されるか。
RQ4下流のGNNタスクと統合するのに最も適したトレーニング戦略と目的は何か。

主な発見

これはGraphDAに特化した最初の調査であり、形式的定式化と統一されたレビューを提供する。
対象モダリティ（構造指向、特徴指向、ラベル指向）別のGraphDA技術の包括的分類が提示される。
低リソースグラフ学習（SSLおよび半教師あり学習）と信頼性の高いグラフ学習へのGraphDAの適用が分析される。
本論は分離学習、結合学習、二レベル最適化といったトレーニングパラダイムを論じ、未解決の課題と今後の方向性を概説する。
GraphDA技術を実践的なグラフタスクに結びつけ、効果的な拡張の設計原理を強調する。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。