QUICK REVIEW

[论文解读] Unified-MAS: Universally Generating Domain-Specific Nodes for Empowering Automatic Multi-Agent Systems

Hehai Lin, Yu Yan|arXiv (Cornell University)|Mar 23, 2026

Advanced Graph Neural Networks被引用 0

一句话总结

该论文提出 Unified-MAS——一个离线两阶段框架，从外部知识中合成领域特定节点并通过困惑度引导的奖励进行优化，以提升知识密集型领域的 Automatic-MAS 性能，同时降低成本。

ABSTRACT

Automatic Multi-Agent Systems (MAS) generation has emerged as a promising paradigm for solving complex reasoning tasks. However, existing frameworks are fundamentally bottlenecked when applied to knowledge-intensive domains (e.g., healthcare and law). They either rely on a static library of general nodes like Chain-of-Thought, which lack specialized expertise, or attempt to generate nodes on the fly. In the latter case, the orchestrator is not only bound by its internal knowledge limits but must also simultaneously generate domain-specific logic and optimize high-level topology, leading to a severe architectural coupling that degrades overall system efficacy. To bridge this gap, we propose Unified-MAS that decouples granular node implementation from topological orchestration via offline node synthesis. Unified-MAS operates in two stages: (1) Search-Based Node Generation retrieves external open-world knowledge to synthesize specialized node blueprints, overcoming the internal knowledge limits of LLMs; and (2) Reward-Based Node Optimization utilizes a perplexity-guided reward to iteratively enhance the internal logic of bottleneck nodes. Extensive experiments across four specialized domains demonstrate that integrating Unified-MAS into four Automatic-MAS baselines yields a better performance-cost trade-off, achieving up to a 14.2% gain while significantly reducing costs. Further analysis reveals its robustness across different designer LLMs and its effectiveness on conventional tasks such as mathematical reasoning.

研究动机与目标

识别当前 Automatic-MAS 在知识密集型领域因缺乏领域先验知识而存在的局限性。
通过离线合成领域特定节点，将节点实现与拓扑编排解耦。
开发一个两阶段框架（基于搜索的节点生成和基于奖励的节点优化）以产生鲁棒的领域感知节点。
证明在多种专业基准上，整合生成节点能够提升性能并降低成本。

提出的方法

搜索驱动的节点生成：从任务样本中提取多维领域关键词，检索外部开放知识（Google、GitHub、Google Scholar），并从策略引导的提示中合成初步领域特定节点。
基于奖励的节点优化：将节点推理视为轨迹；使用困惑度为基础的奖励来评估并迭代改进瓶颈节点。
困惑度目标：J(Pθ,y,q,A_t) = -log(PPL(y|q,A_t))；J 越大表示可预测性越好。
计算每个节点的改进分数和一致性分数以形成节点质量分数 S_t，指导迭代优化。
通过在验证样本上的平均奖励识别瓶颈节点 v*，并在 K 个纪元内对其内部实现进行细化。
积分器动态：离线节点合成输入到现有 Automatic-MAS 基线，评估性能-成本权衡。

实验结果

研究问题

RQ1离线生成的领域特定节点是否能提升知识密集型领域中 Automatic-MAS 的性能？
RQ2困惑度引导的奖励优化能否可靠地增强合成节点的内部逻辑？
RQ3Unified-MAS 对不同设计师 LLM 是否鲁棒并且能否推广到非专业领域（如数学）？
RQ4在向现有 MAS 基线添加领域特定节点库时，对性能和成本有何影响？

主要发现

Unified-MAS 在四个专业基准上呈现稳定的性能提升，平均提升最高可达 14.2%。
将 Unified-MAS 的领域特定节点整合到基线中可降低成本。
该方法对不同的设计师 LLM（如 Gemini-3-Pro、Gemini-3-Flash、GPT-5-Mini）具有鲁棒性，并能推广到如数学等一般领域。
困惑度引导的优化逐步提升节点质量，在纪元中可识别的瓶颈得到改进。
无领域 grounding 的动态节点生成方法显示出不稳定性；Unified-MAS 提供稳定且领域 grounding 的改进。
定性分析表明 Unified-MAS 能创建结构化、类似专家的工作流（如司法工作流），而非表层的集成。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。