QUICK REVIEW

[论文解读] HyperRAG: Reasoning N-ary Facts over Hypergraphs for Retrieval Augmented Generation

Wen-Sheng Lien, Yu-Kai Chan|arXiv (Cornell University)|Feb 16, 2026

Advanced Graph Neural Networks被引用 0

一句话总结

HyperRAG 提出对 n 阶超图进行推理以进行检索增强生成，具备两种检索模式（HyperRetriever 和 HyperMemory）以降低语义碎片化和路径爆炸，并提升多步问答性能。

ABSTRACT

Graph-based retrieval-augmented generation (RAG) methods, typically built on knowledge graphs (KGs) with binary relational facts, have shown promise in multi-hop open-domain QA. However, their rigid retrieval schemes and dense similarity search often introduce irrelevant context, increase computational overhead, and limit relational expressiveness. In contrast, n-ary hypergraphs encode higher-order relational facts that capture richer inter-entity dependencies and enable shallower, more efficient reasoning paths. To address this limitation, we propose HyperRAG, a RAG framework tailored for n-ary hypergraphs with two complementary retrieval variants: (i) HyperRetriever learns structural-semantic reasoning over n-ary facts to construct query-conditioned relational chains. It enables accurate factual tracking, adaptive high-order traversal, and interpretable multi-hop reasoning under context constraints. (ii) HyperMemory leverages the LLM's parametric memory to guide beam search, dynamically scoring n-ary facts and entities for query-aware path expansion. Extensive evaluations on WikiTopics (11 closed-domain datasets) and three open-domain QA benchmarks (HotpotQA, MuSiQue, and 2WikiMultiHopQA) validate HyperRAG's effectiveness. HyperRetriever achieves the highest answer accuracy overall, with average gains of 2.95% in MRR and 1.23% in Hits@10 over the strongest baseline. Qualitative analysis further shows that HyperRetriever bridges reasoning gaps through adaptive and interpretable n-ary chain construction, benefiting both open and closed-domain QA.

研究动机与目标

为复杂的、多实体事实的检索增强生成（RAG）提供动机，因为二进制图难以表达。
提出 HyperRAG，以在 n 阶超图上运作，保持关系完整性并减少推理步数。
开发两种检索范式——HyperRetriever（结构-语义链学习）与 HyperMemory（LLM 指引的记忆），实现自适应检索。
在开放领域问答基准和封闭领域 WikiTopics CLQA 上评估 HyperRAG，以展示准确性和效率优势。

提出的方法

将 n 阶关系超图作为 RAG 的知识底层，以避免语义碎片化和路径爆炸。
HyperRetriever 使用一个 MLP 将主题实体和超边的结构信号与语义信号融合，构建与查询相关的关系链。
HyperRetriever 从 n 阶事实派生准二元三元组，应用方向距离编码（DDE）来表示结构近似性，并使用对比式 MLP 对三元组的可行性进行打分。
带阈值和密度自适应策略的自适应搜索，控制检索深度和广度。
HyperMemory 采用基于 LLM 的记忆来引导对超图的束搜索，给超边和尾部实体打分以形成候选路径。
一个考虑预算的上下文生成器将检索到的证据和查询上下文格式化，供 LLM 生成最终答案。

Figure 1 . Structural Comparison of (a) Knowledge Graphs and (b) Hypergraphs. For a given question $q$ , (a) requires 3-hop reasoning over binary facts, while (b) enables single-hop inference via an $n$ -ary relational fact, yielding a more compact and expressive multi-entity representation.

实验结果

研究问题

RQ1与二元知识图相比，n 阶超图表示是否可以提高检索保真度并减少 RAG 中的推理步数？
RQ2自适应的 HyperRetriever 与基于记忆的 HyperMemory 是否在开放领域和封闭领域问答中实现更高的准确性和效率？
RQ3在超边上的高阶推理如何影响 RAG 的可解释性和证据追溯？
RQ4HyperRetriever 与 HyperMemory 对问答性能和泛化的相对影响为何？
RQ5该方法对未见主题以及密集 vs 稀疏超图的泛化能力如何？

主要发现

HyperRAG 在 WikiTopics CLQA 上的平均提升为：回答准确性更高，MRR 提升 2.95%、Hits@10 提升 1.23%，相较最强基线的提升。
HyperRetriever 在开放域问答基准（如 HotpotQA、MuSiQue）中优于 HyperMemory，原因在于其可学习的可行性打分与自适应扩展。
在密集的跨域设置中，HyperRetriever 实现显著提升（如在 2WikiMultiHopQA 上相对 F1 提升 11.89%）。
在 11 个主题上的平均 MRR 提升为 2.95%、Hits@10 提升为 1.23%，相对于第二好基线具有统计显著性（p << 0.001）。
与 HyperRetriever 相比，HyperMemory 通常表现较弱，凸显了对 n 阶检索而言，显式的结构学习相对 LLM 参数记忆的价值。

Figure 2 . The overall framework of HyperRAG.

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。