QUICK REVIEW

[论文解读] Co-RedTeam: Orchestrated Security Discovery and Exploitation with LLM Agents

Pengfei He, Ash Fox|arXiv (Cornell University)|Feb 2, 2026

Information and Cyber Security被引用 0

一句话总结

Co-RedTeam 是一个具备安全意识的多智能体框架，利用LLMs自动发现并利用软件漏洞，结合执行反馈与长期记忆，在安全基准上实现强大的利用与检测性能。

ABSTRACT

Large language models (LLMs) have shown promise in assisting cybersecurity tasks, yet existing approaches struggle with automatic vulnerability discovery and exploitation due to limited interaction, weak execution grounding, and a lack of experience reuse. We propose Co-RedTeam, a security-aware multi-agent framework designed to mirror real-world red-teaming workflows by integrating security-domain knowledge, code-aware analysis, execution-grounded iterative reasoning, and long-term memory. Co-RedTeam decomposes vulnerability analysis into coordinated discovery and exploitation stages, enabling agents to plan, execute, validate, and refine actions based on real execution feedback while learning from prior trajectories. Extensive evaluations on challenging security benchmarks demonstrate that Co-RedTeam consistently outperforms strong baselines across diverse backbone models, achieving over 60% success rate in vulnerability exploitation and over 10% absolute improvement in vulnerability detection. Ablation and iteration studies further confirm the critical role of execution feedback, structured interaction, and memory for building robust and generalizable cybersecurity agents.

研究动机与目标

推动自动化红队测试以大规模识别漏洞并减少对劳动密集型手工过程的依赖。
通过引入多智能体、基于执行的工作流来解决单次推理的局限性。
结合安全域扎根、代码感知分析与记忆，以模拟真实世界的红队工作流程。
实现迭代式规划、执行、评估，以及基于记忆的学习，以提高鲁棒性与泛化能力。

提出的方法

编排者协调两个阶段：漏洞发现与迭代利用，使用面向角色的工具与执行隔离。
阶段 I（漏洞发现）：分析与批判代理人进行基于代码的分析，结合 CWE/OWASP 扎根与代码浏览，生成有证据的漏洞草稿。
阶段 II（迭代利用）：规划者、验证者、执行者与评估者在沙箱环境中形成闭环，按真实执行反馈引导复现漏洞。
基于执行的循环把假设转化为具体行动，具有明确计划、验证门径，并基于执行结果进行迭代优化。
分层的长期记忆存储漏洞模式、策略记忆和技术行动，以实现经验复用与逐步改进。

Figure 1 : Overview of Co-RedTeam . Co-RedTeam is a security-aware multi-agent framework for automatic vulnerability discovery and exploitation. (Top) Given a target codebase (and optional vulnerability hints), the orchestrator coordinates two stages. Stage I ( Vulnerability Discovery): Analysis and

实验结果

研究问题

RQ1一个具备安全感知的多智能体框架是否在真实基准上优于单一代理LLM方法在漏洞发现与利用上的表现？
RQ2执行反馈、记忆与结构化交互对自动化红队任务的成功有何影响？
RQ3将漏洞发现与利用分离对自动化网络安全分析的准确性与鲁棒性有何影响？
RQ4长期记忆在多任务与代码库上的持续改进程度有多大？

主要发现

Co-RedTeam 在基准测试中实现了强大的利用与检测性能，超越使用静态分析或非执行推理的基线方法。
使用 Gemini-3-Pro，Co-RedTeam 在 CyBench 的利用成功率达到 63.7%，在 BountyBench 的利用率65.0%、检测率20.0%，在 CyberGym 的 ASR 为37.3%。
框架显示执行反馈、结构化交互与记忆是关键；删去任一组件都会显著降低性能（例如移除执行，CyBench 的利用率降至 17.5%。
长期记忆使随时间的记忆驱动性能提升，暖启动和演化记忆配置在早期阶段优于冷启动，并会随着经验持续改进。

Figure 2 : Effect of maximum exploitation iterations. Success rate on CyBench versus maximum exploitation iteration, illustrating how Co-RedTeam benefits from iterative planning.

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。