QUICK REVIEW

[論文レビュー] Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models

Linhao Luo, Zicheng Zhao|arXiv (Cornell University)|Oct 16, 2024

Advanced Graph Neural Networks被引用数 5

ひとこと要約

GCRはKG-TrieインデックスでLLMの推論を制約するKG-grounded decodingフレームワークを導入し、知識グラフ上で忠実でゼロ幻覚の推論と最先端のKGQA性能を実現します。

ABSTRACT

Large language models (LLMs) have demonstrated impressive reasoning abilities, but they still struggle with faithful reasoning due to knowledge gaps and hallucinations. To address these issues, knowledge graphs (KGs) have been utilized to enhance LLM reasoning through their structured knowledge. However, existing KG-enhanced methods, either retrieval-based or agent-based, encounter difficulties in accurately retrieving knowledge and efficiently traversing KGs at scale. In this work, we introduce graph-constrained reasoning (GCR), a novel framework that bridges structured knowledge in KGs with unstructured reasoning in LLMs. To eliminate hallucinations, GCR ensures faithful KG-grounded reasoning by integrating KG structure into the LLM decoding process through KG-Trie, a trie-based index that encodes KG reasoning paths. KG-Trie constrains the decoding process, allowing LLMs to directly reason on graphs and generate faithful reasoning paths grounded in KGs. Additionally, GCR leverages a lightweight KG-specialized LLM for graph-constrained reasoning alongside a powerful general LLM for inductive reasoning over multiple reasoning paths, resulting in accurate reasoning with zero reasoning hallucination. Extensive experiments on several KGQA benchmarks demonstrate that GCR achieves state-of-the-art performance and exhibits strong zero-shot generalizability to unseen KGs without additional training.

研究の動機と目的

構造化された KG 知識と非構造化された LLM 推論を橋渡しして忠実な KG-aware の回答を得る
KG 構造で LLM のデコードを制約し幻覚を排除する
軽量な KG-specialized LLM と強力な一般 LLM を活用して複数パスの効率的で帰納的な推論を行う
unseen KGs へのゼロショット一般化で最先端の KGQA 性能を達成する
取得型・エージェントベースの KG 推論手法に比べて効率性を示す

提案手法

KG を KG-Trie に変換し推論パスの構造化インデックスとする
KG-specialized LLM によるグラフ制約デコードで KG-grounded 推論パスと仮説回答を KG-Trie 制約の下で生成する
グラフ制約デコードデータで軽量 KG-specialized LLM をファインチューニングする
複数の KG-grounded パスを一般 LLM（FiD風）に入力して最終的な帰納推論と回答生成を行う
追加トレーニングなしで unseen KGs に KG-Trie を適用してゼロショット一般化を行う

実験結果

リサーチクエスチョン

RQ1GCR は有利な効率性を備えた最先端の推論性能を達成できるか
RQ2GCR は幻覚を排除しKG-grounded 推論の忠実性を確保できるか
RQ3GCR は unseen KGs へゼロショット設定で一般化できるか
RQ4KG-specialized LLM と一般 LLM の寄与は何か
RQ5ビーム幅とパス長さは性能と効率にどのような影響を与えるか

主な発見

Type	Methods	WebQSP Hit	WebQSP F1	CWQ Hit	CWQ F1
LLM Reasoning	Qwen2-0.5B	26.2	11.0	12.5	11.0
LLM Reasoning	Qwen2-1.5B	41.3	28.0	18.5	15.7
LLM Reasoning	Qwen2-7B	50.8	35.5	25.3	21.6
LLM Reasoning	Llama-2-7B	56.4	36.5	28.4	21.4
LLM Reasoning	Llama-3.1-8B	55.5	34.8	28.1	22.4
LLM Reasoning	GPT-4o-mini	63.8	40.5	63.8	40.5
LLM Reasoning	ChatGPT	59.3	43.5	34.7	30.2
LLM Reasoning	ChatGPT+Few-shot	68.5	38.1	38.5	28.0
LLM Reasoning	ChatGPT+CoT	73.5	38.5	47.5	31.0
LLM Reasoning	ChatGPT+Self-Consistency	83.5	63.4	56.0	48.1
Graph Reasoning	GraftNet	66.7	62.4	36.8	32.7
Graph Reasoning	NSM	68.7	62.8	47.6	42.4
Graph Reasoning	SR+NSM	68.9	64.1	50.2	47.1
Graph Reasoning	ReaRev	76.4	70.9	52.9	47.8
KG+LLM	KD-CoT	68.6	52.5	55.7	-
KG+LLM	EWEK-QA	71.3	-	52.5	-
KG+LLM	ToG (ChatGPT)	76.2	-	57.6	-
KG+LLM	ToG (GPT-4)	82.6	-	68.5	-
KG+LLM	EffiQA	82.9	-	69.5	-
GCR	(Llama-3.1-8B + ChatGPT)	92.6	73.2	72.7	60.9
GCR	(Llama-3.1-8B + GPT-4o-mini)	92.2	74.1	75.8	61.7

GCR は WebQSP と CWQ で最先端の Hits を達成し、Hit と F1 の両方でベースラインを上回る。
KG 制約により両データセットで100% の忠実な推論を実現し、 Tested cases で幻覚を排除。
ゼロショット実験で unseen KGs へ転移した場合 CSQA および MedQA で ChatGPT および GPT-4o-mini を上回る。
軽量な KG-specialized LLM（0.5B）はファインチューニング後により大規模モデルを超えることがあり、専門化の利点を強調。
ビーム幅とパス長（K と L）は探索と信頼性のバランスを取り、K=10, L=2 が強い性能と効率のトレードオフを提供。
GCR は KG-Trie 制約でデコードを行い効率を維持し、追加の LLM 呼び出しなしで並列探索を可能にする。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。