QUICK REVIEW

[論文レビュー] RAG-GNN: Integrating Retrieved Knowledge with Graph Neural Networks for Precision Medicine

Hasi Hays, William J. Richardson|arXiv (Cornell University)|Jan 31, 2026

Bioinformatics and Genomic Networks被引用数 0

ひとこと要約

RAG-GNNはグラフニューラルネットワークと生物医療文献からの検索補助知識を組み合わせ、精密医療における機能解釈と治療標的の特定を強化する。 topologyのみの手法と補完的な強みを示す。

ABSTRACT

Network topology excels at structural predictions but fails to capture functional semantics encoded in biomedical literature. We present a retrieval-augmented generation (RAG) embedding framework that integrates graph neural network representations with dynamically retrieved literature-derived knowledge through contrastive learning. Benchmarking against ten embedding methods reveals task-specific complementarity: topology-focused methods achieve near-perfect link prediction (GCN: 0.983 AUROC), while RAG-GNN is the only method achieving positive silhouette scores for functional clustering (0.001 vs. negative scores for all baselines). Information-theoretic decomposition shows network topology contributes 77.3% of predictive information, while retrieved documents provide 8.6% unique information. Applied to cancer signaling networks (379 proteins, 3,498 interactions), the framework identifies DDR1 as a therapeutic target based on retrieved evidence of synthetic lethality with KRAS mutations. These results establish that topology-only and retrieval-augmented approaches serve complementary purposes: structural prediction tasks are solved by network topology alone, while functional interpretation uniquely benefits from retrieved knowledge.

研究の動機と目的

ネットワークトポロジーと取得文献を統合して、生物医学予測における構造-機能ギャップに対処する動機付け。
トポロジーと外部知識を同時に符号化するRAG-GNNフレームワークを開発。
取得文書の予測寄与をトポロジーと比較して定量化。
がんのシグナリングネットワークでフレームワークを実証し、文献サポートを持つ治療標的（DDR1）を検証。

提案手法

GNNベースのトポロジー埋め込みと取得文書埋め込みを共通のセマンティック空間で結合するジョイント埋め込みフレームワークを開発。
意味的類似性と品質加重関連度スコアに基づいてノードごとにトップkの文献を選択する密なリトリーバを使用。
取得文書から注意機構を用いて文書埋め込みを用いた文脈化された知識ベクトルを計算。
構造埋め込みと意味埋め込みを連結またはゲート付きフュージョンで融合し、最終ノード表現を得る。
タスク損失、取得損失、コントラスト整列損失を組み合わせた多目的損失を最適化（L_total = L_task + lambda1 L_retrieval + lambda2 L_contrastive）。
リンク予測・機能的クラスタリング・ノード分類タスクの10種の埋め込み法とベンチマークを行い、情報理論的およびカウンターファクチュアル分析で取得知識内容を検証。

Figure 1: RAG-GNN framework for precision medicine: Architecture overview. The framework integrates six interconnected components for knowledge-augmented biomedical prediction. (1) Biological network input: heterogeneous molecular interaction networks representing protein-protein interactions, signa

実験結果

リサーチクエスチョン

RQ1 topologyのみと取得補助埋め込みは予測タスク全般で補完的な強みを示すか。
RQ2 ネットワークトポロジーを超えた取得知識の予測性能への定量的寄与はどれくらいか。
RQ3 取得文献は機能的解釈と標的同定を改善する独自かつ冗長性のない情報を提供できるか。
RQ4 現実世界の展開を模した時間分離検証でRAG-GNNはどう機能するか。
RQ5 設計容量を超えた取得知識が必要か、それとも同等容量のトポロジーのみモデルで同等性を達成できるか。

主な発見

トポロジーは予測情報の77.3%を寄与し、取得文献は独自に8.6%を寄与。
トポロジーのみの方法が失敗する領域で機能的クラスタリングに対して正のシルエットスコアを達成（0.001）。
がんのシグナリングネットワーク（379タンパク質、3498相互作用）でDDR1が治療標的として同定され、KRAS変異との合成致死性の取得証拠あり。
時系列検証でAUROCは0.891と、完全コーパスAUROCの0.912に近く、新規標的への一般化を示唆。
同容量のトポロジーのみGNN（取得なし）でAUROCは0.847、RAG-GNNは0.912で、容量を超えた取得知識の価値を支持。
カウンターファクトリトリーバル実験は、取得が低下すると性能が大幅に低下することを示し、取得内容の実質的な利益を示唆。

Figure 2: RAG-GNN architecture for precision medicine. The complete system integrates network topology encoding, knowledge retrieval, and context fusion through six main components. The forward pass (solid arrows) begins with the input network $\mathcal{G}^{(p)}=(\mathbf{A},\mathbf{X})$ representing

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。