QUICK REVIEW

[论文解读] SPEAR: An Engineering Case Study of Multi-Agent Coordination for Smart Contract Auditing

Arnab Mallick, Indraveni Chebolu|arXiv (Cornell University)|Feb 4, 2026

Multi-Agent Systems and Negotiation被引用 0

一句话总结

SPEAR 引入一个五智能体 MAS 框架用于自治智能合约审计，包含计划、执行、修复、执行驱动和协调智能体，并在鲁棒性和对故障的抗性方面优于基线。

ABSTRACT

We present SPEAR, a multi-agent coordination framework for smart contract auditing that applies established MAS patterns in a realistic security analysis workflow. SPEAR models auditing as a coordinated mission carried out by specialized agents: a Planning Agent prioritizes contracts using risk-aware heuristics, an Execution Agent allocates tasks via the Contract Net protocol, and a Repair Agent autonomously recovers from brittle generated artifacts using a programmatic-first repair policy. Agents maintain local beliefs updated through AGM-compliant revision, coordinate via negotiation and auction protocols, and revise plans as new information becomes available. An empirical study compares the multi-agent design with centralized and pipeline-based alternatives under controlled failure scenarios, focusing on coordination, recovery behavior, and resource use.

研究动机与目标

展示将既有的 MAS 协同模式应用于现实安全工作流中的自治智能合约审计的方式。
开发一个具备风险感知的 Planning Agent，以应对信息变化时的审计优先级排序。
实现一个具有程序优先修复策略的 Repair Agent，以修复脆弱的生成产物。
展示一个协同框架，在故障情况下实现自治恢复和资源仲裁。
在协调性与恢复方面对 MAS 设计与集中式和流水线式替代方案进行实证比较。

提出的方法

将审计建模为由专门代理执行的任务：Planning、Execution、Repair、Safe Actuation 和 Coordinator。
对代理本地信念采用符合 AGM 的信念修正，以及对决策采用确定性策略。
通过 Plan Negotiation、Contract Net 和 Resource Auction 协议进行协调。
为修复脆弱的生成产物实现一个程序优先修复策略（PFIR）。
通过与 Damn Vulnerable DeFi 基准相比的实证实验对 SPEAR 与基线进行评估。
对 MAS 进行形式化建模（状态、事件、信念、目标和协议），并证明诸如无死锁、受限终止等性质。

Figure 1: SPEAR architecture: five specialized agents ( $A_{P}$ , $A_{E}$ , $A_{R}$ , $A_{C}$ , $A_{Coord}$ ) interact through a message bus, coordinating via Contract Net, plan negotiation, and resource auctions.

实验结果

研究问题

RQ1在相同分析逻辑下，MAS 协调结构如何影响审计任务的进展？
RQ2PFIR 自愈算法在从生成产物失败中恢复方面有多有效？
RQ3战略规划代理是否提高审计效率，尤其是一产出关键漏洞的时间？
RQ4在故障情境下，MAS组件（协议、自治性、自愈）如何提升系统鲁棒性？

主要发现

SPEAR 的整体有效性高于所有基线，包括集中调度器。
PFIR 始终能够从生成产物的失败中恢复，大多数修复在生成修复前是确定性的。
规划代理使高风险漏洞的早期发现成为可能，降低了首次发现关键漏洞的时间。
消融研究表明协议、自治性和自愈显著提升恢复时间并减少对大语言模型的调用。
完整 SPEAR 相较于缺少协议、自治性或自愈的配置，在恢复速度（2.3 分钟）和协调开销方面更优。
注入故障时，MAS 配置的方差较低、进展更稳定，与集中式基线相比表现更好。

Figure 2: Performance of the PFIR self-healing algorithm. The programmatic-first echelon resolves most failures, achieving consistent success across batches.

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。