QUICK REVIEW

[論文レビュー] Reasoner-Executor-Synthesizer: Scalable Agentic Architecture with Static O(1) Context Window

Ivan Dobrovolskyi|arXiv (Cornell University)|Mar 23, 2026

Topic Modeling被引用数 0

ひとこと要約

tldr: RESを導入する、意図解析、決定論的データ取得、物語生成を分離する三層エージェンティックアーキテクチャで、トークン複雑さをO(1)に抑え、LLMに生データを露出させないことでデータハルシネーションを排除。

ABSTRACT

Large Language Models (LLMs) deployed as autonomous agents commonly use Retrieval-Augmented Generation (RAG), feeding retrieved documents into the context window, which creates two problems: the risk of hallucination grows with context length, and token cost scales linearly with dataset size. We propose the Reasoner-Executor-Synthesizer (RES) architecture, a three-layer design that strictly separates intent parsing (Reasoner), deterministic data retrieval and aggregation (Executor), and narrative generation (Synthesizer). The Executor uses zero LLM tokens and passes only fixed-size statistical summaries to the Synthesizer. We formally prove that RES achieves O(1) token complexity with respect to dataset size, and validate this on ScholarSearch, a scholarly research assistant backed by the Crossref API (130M+ articles). Across 100 benchmark runs, RES achieves a mean token cost of 1,574 tokens regardless of whether the dataset contains 42,000 or 16.3 million articles. The architecture eliminates data hallucination by construction: the LLM never sees raw records. KEYWORDS LLM agents; agentic architecture; hallucination elimination; token optimization; context window; retrieval-augmented generation; deterministic execution; scholarly metadata; Crossref API; O(1) complexity.

研究の動機と目的

研究目的と動機: 大規模文脈ウィンドウに依存するLLM主導の自律エージェントのハルシネーションとトークンコストの削減を動機づける。
三層アーキテクチャの提案: data-intensive tasksの関心事を分離する（Reasoner, Executor, Synthesizer）。
データセットサイズに対して一定のトークン複雑性を達成しつつ、正確な学術的検索と統合を維持する。

提案手法

厳密な関心分離を伴う三層RESアーキテクチャの提案: Reasoner (intent parsing)、Executor (deterministic retrieval and aggregation)、Synthesizer (narrative generation) 。
ExecutorはLLMトークンをゼロ使用し、固定サイズの統計要約をSynthesizerに渡す。
データセットサイズに対するO(1)トークン複雑性を形式的に証明。
ScholarSearch (Crossref API)で13千万以上の記事を用いて検証。
100回のベンチマーク実行を経験的に評価し、データセットサイズに依存しない安定したトークンコストを報告（42,000～1630万記事）。
LLMが生データを一切見ない構造によりデータハルシネーションを排除していることを実証。

実験結果

リサーチクエスチョン

RQ1三層アーキテクチャ（Reasoner, Executor, Synthesizer）はデータセットサイズに対してトークン使用量をO(1)に制限できるか？
RQ2Executorによる決定論的データ取得はRAGベースのエージェントにおけるLLMハルシネーションを減少または排除できるか？
RQ3RESを用いた学術検索タスクにおける大規模コーパスでの実用的なトークンコストはどのくらいか？

主な発見

RESは100回の実行で平均トークンコスト1,574トークンを達成し、データセットサイズに依存せず（42k対1630万記事）
Executorは固定サイズの統計要約をSynthesizerに送信し、LLMが生データを露出せず（ハルシネーションリスクを低減）
Crossref APIを用いたScholarSearchで大規模な学術コーパスへのスケーラビリティを実証
形式的分析によりデータセットサイズに対してO(1)トークン複雑性を提供
コンテキストウィンドウスケーリングと比較して、トークンコストを大幅に削減しつつ性能を維持

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。