QUICK REVIEW

[论文解读] Reasoner-Executor-Synthesizer: Scalable Agentic Architecture with Static O(1) Context Window

Ivan Dobrovolskyi|arXiv (Cornell University)|Mar 23, 2026

Topic Modeling被引用 0

一句话总结

引入 RES，一种三层代理架构，将意图解析、确定性数据检索与叙事生成分离，实现 O(1) 令牌复杂度，并通过永不向 LLM 暴露原始记录来避免数据幻觉。

ABSTRACT

Large Language Models (LLMs) deployed as autonomous agents commonly use Retrieval-Augmented Generation (RAG), feeding retrieved documents into the context window, which creates two problems: the risk of hallucination grows with context length, and token cost scales linearly with dataset size. We propose the Reasoner-Executor-Synthesizer (RES) architecture, a three-layer design that strictly separates intent parsing (Reasoner), deterministic data retrieval and aggregation (Executor), and narrative generation (Synthesizer). The Executor uses zero LLM tokens and passes only fixed-size statistical summaries to the Synthesizer. We formally prove that RES achieves O(1) token complexity with respect to dataset size, and validate this on ScholarSearch, a scholarly research assistant backed by the Crossref API (130M+ articles). Across 100 benchmark runs, RES achieves a mean token cost of 1,574 tokens regardless of whether the dataset contains 42,000 or 16.3 million articles. The architecture eliminates data hallucination by construction: the LLM never sees raw records. KEYWORDS LLM agents; agentic architecture; hallucination elimination; token optimization; context window; retrieval-augmented generation; deterministic execution; scholarly metadata; Crossref API; O(1) complexity.

研究动机与目标

Motivate reducing hallucinations and token costs in LLM-driven autonomous agents that rely on large context windows.
Propose a three-layer architecture (Reasoner, Executor, Synthesizer) to separate concerns in data-intensive tasks.
Achieve constant token complexity with respect to dataset size while preserving accurate scholarly retrieval and synthesis.

提出的方法

提出三层 RES 架构，严格职责分离：Reasoner（意图解析）、Executor（确定性检索与聚合）、Synthesizer（叙事生成）。
Executor 使用零 LLM 令牌并向 Synthesizer 传递固定大小的统计摘要。
形式化证明 RES 在数据集规模方面实现 O(1) 的令牌复杂度。
在 ScholarSearch（Crossref API）上进行验证，覆盖 130M+ 篇文章。
在 100 次基准运行中进行实证评估，报告独立于数据集规模（42,000 至 16.3 百万篇文章）的稳定令牌成本。
通过构建方式证明消除了数据幻觉，因为 LLM 从不看到原始记录。

实验结果

研究问题

RQ1一个三层架构（Reasoner、Executor、Synthesizer）是否将令牌使用量绑定在相对于数据集规模的 O(1)？
RQ2通过 Executor 的确定性数据检索是否降低或消除了基于 RAG 的代理中的 LLM 幻觉？
RQ3在使用 RES 时，对极大语料库的学术检索任务的实际令牌成本是多少？

主要发现

RES 在 100 次运行中平均令牌成本为 1,574 令牌，与数据集规模无关（42k 与 16.3M 篇文章）。
Executor 向 Synthesizer 传输固定大小的统计摘要，避免 LLM 暴露于原始记录（降低幻觉风险）。
在 Crossref API 的 ScholarSearch 验证显示对大规模学术语料库的可扩展性。
根据形式分析，该架构实现了相对于数据集规模的 O(1) 令牌复杂度。
与上下文窗口规模相比，该方法在保持性能的同时显著降低了令牌成本。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。